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1. 


INTRODUCTION 


This  final  report  covers  a  five-year  program  conducted  by  SRI  International  (SRI)  for  the 
Office  of  Naval  Research.  The  focus  of  this  program  has  changed  over  its  five-year  duration. 
The  original  focus  was  on  the  theory  of  energy-assisted  epitaxy,  a  continuation  of  the  previous 
contract.  Our  focus  then  shifted  to  the  properties  of  ordered  semiconductors  grown  by  epitaxial 
methods.  Finally,  in  the  last  year  of  the  program,  our  focus  shifted  toward  the  development  of  a 
new  material  we  had  predicted,  Ini_xTlxSb. 

Because  of  the  volume  of  material  covered  in  this  program,  we  have  organized  this  final 
report  as  a  brief  summary  of  the  work,  with  the  contract-supported  published  papers  presented  as 
an  appendix.  Because  much  of  the  work  on  the  last  topic  studied,  InTlSb,  has  not  been 
published,  we  include  a  section  on  this  topic. 


2.  THEORY  OF  ENERGY-ASSISTED  EPITAXY  AND  THE  THEORY  OF 
ORDERED  SEMICONDUCTOR  ALLOYS 


The  primary  goal  of  this  portion  of  the  program  was  to  understand  energy-assisted 
epitaxial  growth  of  semiconductors  and  to  predict  and  study  the  properties  of  ordered 
semiconductors  that  might  result  Subtasks  include  the  study  of  surface  energetics  including  the 
effects  of  surface  charge  transfer,  the  development  of  an  epitaxial  growth  model  including  the 
effects  of  energy  assist;  the  prediction  of  the  ordered  alloys  that  may  result  from  the 
nonequilibrium  growth;  and  the  study  of  the  properties,  including  stability,  of  ordered  materials 
grown  by  nonequilibrium  methods.  A  very  brief  summary  of  this  work  and  major  conclusions 
^awn  from  it  are  given  here. 

The  development  of  a  growth  model  requires  a  reliable  estimate  of  the  interaction  energy 
between  constituent  atoms  with  various  local  environments.  Once  these  energies  are  known, 
epitaxial  growth  can  be  modeled  using  appropriate  statistics.  We  have  used  the  calculated  pair 
interaction  energies  in  a  quasichemical  approximation  to  study  the  qualitative  nature  of 
semiconductor  surfaces.  The  effect  of  energy  assist  has  been  modeled  through  the  consequent 
change  in  surface  entropy. 

In  this  program  we  have  shown  that  charge  transfer  between  surface  cations  and  anions 
substantially  affects  the  surface  atom  interaction  energies  and  consequently  the  nature  of  the 
epitaxially  grown  surfaces.  Relaxation  of  surface  atoms  is  found  to  reduce  the  surface  energies 
by  of  the  order  of  an  electron  volt  We  have  also  developed  models  of  the  surfaces  during  atomic 
layer  epitaxy  (ALE)  and  molecular  beam  epitaxy  (MBE)  growth,  using  the  calculated  pair 
energies  in  a  quasi-chemical  approximation. 
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An  observed  difference  in  the  growth  of  HgCdTe  on  the  (1 1 1)  A  and  B  surfaces,  which 
cannot  be  explained  using  simple  bond-breaking  energy  arguments,  has  been  explained  using  a 
growth  model  based  on  our  surface  binding  energies.  A  varient  of  the  generalized  penurbation 
method  (GPM)  was  developed  to  obtain  cluster  energies  in  semiconductor  alloys. 

Under  this  program  we  have  also  developed  a  model  of  segregation  at  the  free  surfaces 
and  interfaces  of  semiconductor  alloys.  This  model  has  been  applied  to  a  number  of  systems, 
including  HgCdTe  and  HgZnTe.  We  have  also  developed  phase  diagram  models  using  our 
quasichemical  formalism  and  applied  it  to  several  systems. 

Several  properties,  such  as  the  elastic  constants  and  cohesive  energies  of  the  semi¬ 
conductors,  are  necessary  input  for  the  growtli  models  we  have  developed.  Thus  some  effort 
under  this  program  was  expended  on  refining  some  tight-binding  models  of  these  properties  to 
provide  more  accurate  and  predictive  models  on  which  the  growth  models  were  based.  We  have 
also  used  these  models  to  study  native  point  defects  in  the  semiconductor  alloys. 

A  linear  combination  of  atomic  orbitals  (LCAO)  Hamiltonian  was  developed  to  apply  to 
the  growth  problem  in  the  semiconctor  alloys.  While  no  papers  were  published  on  this  work 
because  our  funding  was  depleted  before  the  work  could  be  completed,  a  preliminary  paper*  (see 
Appendix)  on  this  work  was  written  and  applied  to  the  ordering  in  GaAlAs,  which  we  find  to  be 
driven  by  surface  interactions. 

The  important  conclusions  we  have  drawn  from  our  work  are: 

•  Surface  sublimation  and  binding  energies  needed  in  growth  models  are  surface- 
orientation  dependent 

•  Surface  sublimation  energies  are  not  simply  proportional  to  the  number  of  first 
neighbor  bonds  made  to  the  surface 

•  Charge  transfer  among  surface  atoms  is  a  large  component  of  surface  sublimation 
energies  and  will  be  most  important  in  wide  bandgap  materials 

•  Charge  transfer  among  surface  atoms  can  lead  to  ordering  of  the  semiconductor 
surface  below  some  critical  temperature,  Tc 

•  A  simple  growth  model  incorporating  our  surface-dependent  binding  and  sublimation 
energies  can  explain  the  difference  in  the  growth  rate  of  HgCdTe  on  the  (1 1 1)A  and 
(lll)B  surfaces 

•  A  multilayer  growth  model  was  developed  and  applied  to  HgTe,  CdTe,  Si,  and  GaAs 
to  determine  regimes  in  which  layer-by-layer  and  three-dimensional  growth  occurs 

•  Ordering  in  semiconductor  alloys  at  the  growth  temperature  is  found  to  be  metastable 
in  all  systems  studied 

•  Surface  segregation  is  surface-orientation-dependent 


*  S.  Krishnamurthy  and  A.  Sher,  “Ordering  in  GaAlAs.” 
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•  Long-range  alloy  compositional  oscillation  away  from  a  surface  can  occur  metastably 
below  some  critial  temperature,  and  its  formation  is  driven  by  the  same  m^hanisms 
which  drives  surface  segregation 

•  Vacancy  formation  energies  in  pseudobinary  semiconductor  alloys  are  very  sensitive 
to  the  alloy  environment 

•  Ordering  in  GaAlAs  is  driven  by  surface  Coulomb  interactions. 


3.  lni_xTlxSb  AS  A  LONG-WAVE  INFRARED  MATERIAL 


We  demonstrated,  based  on  full-potential  linearized  muffin  tin  orbital  (FP-LMTO)  based 
self-consistent  solutions  to  the  Schrodinger  equation  with  the  exchange  term  expressed  in  the 
local  density  approximation  (LDA),  that  TlSb  in  the  zincblende  structure  has  roughly  the  same 
bond  length  (~  2%  smaller)  as  that  of  InSb,  that  it  is  a  semimetal,  and  that  it  has  a  large  cohesive 
energy  comparable  to  that  of  InSb.  We  also  demonstrated  that  TlSb  minimum  energy  is  attained 
in  the  CsQ  structure.  However,  for  x  <  0.15,  the  zincblende  structured  alloy  was  predicted  to  be 
stable  for  attainable  growth  temperatures.  We  also  predicted  that  at  about  x  =  0.09  the  band  gap 
of  the  alloy  would  be  ~  0.1  eV,  the  value  needed  for  long- wave  infrared  (LWIR)  focal  plane 
arrays  (FPA).  Because  for  the  same  band  gap,  0.1  eV,  the  average  bond  energy  of  Ino.9iTlo,9Sb 
is  -  1.43  eV,  compared  to  Hgo.78Cdo,22Te  at  ~  0.88  eV,  we  suggested  the  InTlSb  alloy  may  be  a 
more  robust  mate^  for  FPAs. 

Several  experimental  groups  are  trying  to  grow  the  material,  and  one  at  Northwestern 
claims  to  have  done  so.*  There  are  aspects  of  the  Northwestern  work  that  are  suspect,  and  while 
we  would  like  to  believe  they  have  prepared  the  material,  we  recommend  judgment  be  withheld 
until  further  tests  are  performed  on  their  material. 


4.  COMMENTS  ABOUT  FPA  SYSTEM  ISSUES:  WHAT  ARE  THE 
SYSTEM  MERITS  OF  lni_xTlxSb 


There  has  been  a  heated  debate  about  the  relative  importance  of  individual  pixel 
detectivity  Qugh  quantum  efficiency  and  low  noise)  and  the  spatial  uniformity  of  tlus  detectivity 
over  an  array.  Some  types  of  nonuniformity  are  correctable  in  the  output  circuitry;  others  are 
not  A  theory  says  that  if  you  are  observing  a  high-background-temperature  scene,  then  quantum 
efficiency  is  less  important  to  the  noise  equivalent  contrast  temperature  (NEAT)  (related  to  the 
minimum  resolvable  temperature  in  a  scene)  than  the  spatial  nonuniformity.  However,  for  low- 

*  Y  Ji.  Qkm,  C.  Besikci,  R.  Sudharsanan,  and  M.  Razeghi,  "Growth  of  Ini_xTlxSb,  a  new  infrared  material,  by 
low-pressure  metaloiganic  chemical  vapor  deposition,”  ^pl.  Phys.  Lett.  63, 361  (1993). 


background-temperature  scenes,  the  reverse  is  true.  The  arguments  center  around  the  critical 
scene  temperature  Tcs  that  separates  these  regions  and  the  kinds  of  nonuniformities  it  is  practical 
to  correct.  Some  people  contend  that  Tcs  lies  below  0°C,  so  only  Arctic  and  space  observations 
require  high  quantum  efficiencies.  Others  place  Tcs  well  above  all  typical  earth  backgrounds,  so 
a  high  quantum  efficiency  is  always  beneficial.  From  what  we  have  seen  from  cameras  made 
from  different  materials,  we  believe  the  high-quantum-efficiency  supporters  are  right;  besides 
that,  it  is  logical. 

One  important  source  of  incorrectable  nonuniformity  is  a  quantum  efficiency  that  has  a 
sharp  spectral  dependence  beyond  its  cut-on.  Thus,  a  flat  quantum  efficiency  is  a  distinct 
advantage. 

Being  able  to  build  the  signal  processing  circuit  on  the  same  chip  as  the  detector  array  is 
also  highly  desirable.  This  is  not  possible  at  present  for  HgCdTe,  so  the  FPA  is  In  bump  bonded 
to  a  Si  chip  that  contains  the  readout  circuits.  The  thermal  expansion  mismatch  between  these 
dissimilar  materials  limits  the  size  of  practical  FPAs  made  this  way.  If  larger  arrays  are  needed, 
they  must  be  assembled  in  a  mosaic  pattern,  which  limits  the  picture  quality  and  adds  complexity 
to  the  system.  This  is  why  there  is  so  much  effort  these  days  devoted  to  growing  HgCdTe  on  Si 
substrates.  The  problem  there  is  that  the  lattice  constant  difference  is  -19%,  so  dislocations  are 
plentiful,  which  again  degrades  device  performance. 

There  are  many  other  system  issues,  but  this  minimal  set  suffices  to  set  the  stage  for  the 
major  arguments  surrounding  the  relative  merits  of  LWIR  alloy  detectors  like  HgCdTe  and 
InTlSb,  quantum  well  detectors,  and  strained  layer  superlattice  detectors. 

HgCdTe  256  x  256  pixel  arrays  have  been  built  with  NEAT  =  6  mK.  This  is  more  than 
adequate  for  many  applications,  but  it  is  hard  to  do  with  high  yield,  so  these  arrays  are  expensive. 
Some  strategic  applications  require  much  larger  arrays.  The  quantum  efficiency  of  HgCdTe  is 
-70%,  and  spectrally  flat,  so  corrections  to  spatial  nonuniformity  are  reliable  over  a  large 
dynamic  range.  CdZnTe  currently  is  the  best  substrate  material.  It  is  good,  but  not  superb,  since 
the  sizes  that  can  be  prepared  are  limited,  it  has  moderate  dislocation  densities,  and  some 
troublesome  impurities  tend  to  migrate  into  the  active  material.  Some  of  these  properties  are 
being  fixed. 

The  quantum  well  and  strained  layer  superlattice  detectors  are  basically  built  on  GaAs 
substrates.  Thus,  the  signal  processing  can  be  done  on  the  chip  that  holds  the  array.  However, 
both  materials  have  low  quantum  efficiencies  (<  10%),  and  they  have  sharp  spectral  variations. 
The  supporters  of  these  devices  contend  that  because  they  are  constructed  from  IIT-V  rather  than 
II- VI  material,  their  relative  robustness  will  lead  to  good  spatial  uniformity  (limiting  the  need  for 
corrections),  few  dislocations,  and  generally  high-quality  material.  Reasonable  performance  has 
been  demonstrated  on  64  x  64  arrays,  which  is  quite  good  for  a  newly  evolved  technology. 
However,  they  were  not  as  good  as  current  similar  HgCdTe  arrays.  Thus,  quantum  well  and 
superlattice  supporters  invoke  expectations  based  on  intuition  and  how  well  devices  have  worked 
in  view  of  the  immaturity  of  the  technology.  They  are  saying,  “bet  on  what  you  don’t  know 
about,  rather  than  spend  your  effort  to  fix  the  troubles  you  have  encountered  in  HgCdTe.”  They 
have  a  firm,  legitimate  claim  to  better  substrates  compatible  with  readout  circuits. 


4 


MWIR  InSb  detectors  arrays  are  slightly  better  than  MWIR  Hgo.7Cdo.3Te  detectors,  but 
the  two  are  close  competitors.  LWIR  Hgo.7Cdo.3Te  is  a  much  more  difficult  material,  for  a  wide 
variety  of  reasons,  but  much  of  the  trouble  can  be  traced  back  to  the  fact  that  the  material  is 
70%  HgTe  (bond  strength  0.81  eV/bond)  and  30%  CdTe  (1.1  eV/bond). 

Ino.91Tlo.9Sb  should  have  optical  and  electronic  properties  similar  to  those  of 
Hgo.78Cdo.22Te,  but  it  is  91%  InSb  (1.43  eV/bond)  and  only  9%  TlSb  (1.34  eV/bond).  The 
constituents’  bond  length  difference  in  InTlSb  is  not  as  small  as  that  for  HgCdTe,  but  it  is  only 
~2% — still  a  near-lattice-matched  alloy.  The  most  important  feature  is  that  the  average  bond 
strength  is  much  larger,  which  will  cause  native-point  defect  populations  and  other  imperfection 
densities  to  be  much  smaller  and  less  troublesome. 

The  hope  for  a  fast  utilization  of  lno.91Tlo.09Sb,  once  it  is  grown,  is  that  it  will  be 
essentially  InSb-like  in  its  defect  properties.  If  this  is  the  case,  then  it  will  be  possible  with  only 
slight  modifications  to  substitute  InTlSb  epitaxial  layers  on  InSb  substrates  into  an  InSb  FPA 
production  line,  use  essentially  the  same  readout  circuitry,  and  end  up  with  an  LWIR  response 
rather  than  MWIR.  If  the  LWIR  characteristics  are  as  good  as  we  project,  then  a  sujjerior 
(cheaper)  system  is  the  result.  We  will  have  to  wait  to  see. 
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A  cluster  theory  based  on  the  quasichemical  approximation  has  been  applied  to  study  the  local 
correlation,  bond-length  distribution,  and  phase  diagrams  of  the  Il-VI  pseudobinary  alloys 
Hgi  .  ,Cd,Te.  Hg,  ,Zn,Te.  and  Cd,  _  ,Zn,Te.  The  cluster  energy  is  calculated  by  letting  it 
relax  in  some  effective  alloy  medium  and  then  considering  the  contributions  from  the  strain  and 
chemical  energies.  Two  different  models  are  presented  to  simulate  the  alloy  medium.  While  both 
models  show  that  all  three  alloys  have  nearly  random  distributions,  the  signs  of  the  local 
correlation  prove  to  be  sensitive  to  the  alloy  medium  chosen  for  the  energy  calculation.  Good 
agreement  is  found  betw  een  experiment  and  the  bond  lengths  and  phase  diagrams  in  both  models. 


I.  INTRODUCTION 

This  paper  is  a  summary  of  our  theoretical  investigations'"* 
into  the  statistical  properties  of  Hg,  _,Cd,Te. 
Hg,  ,Zn,Te.  and  Cd,  _  ,Zn,Te  alloys.  A  primary  objective 
of  these  studies  is  to  correlate  microscopic  quantities  such  as 
bond  lengths  and  atomic  distributions  to  macroscopic  quan¬ 
tities  such  as  internal  strain  energy,  phase  diagrams,  and 
mechanical  propenies  of  the  alloy  (e  g.,  bulk  modulus). 

The  theory  is  based  on  a  generalized  quasichemical  ap¬ 
proximation  (GQCA),'  where  the  usual  pair  interaction  is 
replaced  by  a  microcluster  unit  of  arbitrary  size.  This  entails 
the  calculation  of  cluster  energies  in  some  effective  medium 
along  with  a  statistical  formulation.  Once  the  cluster  ener¬ 
gies  are  available,  the  cluster  populations  can  be  computed, 
which  in  turn  yields  the  mixing  free  energy  and  other  quanti- 
lies  mentioned  above. 

The  next  section  will  present  a  summary  of  the  statistical 
theory.  ( Full  details  are  provided  in  Ref  1. )  Section  III  de¬ 
scribes  the  calculation  of  cluster  energies  along  with  a  dis¬ 
cussion  of  effective  alloy  media  in  which  the  clusters  are 
placed.  Section  IV  will  present  results  of  some  quantities 
calculated  with  the  theory.  Section  V  contains  conclusions 
and  discussions. 

II.  GQCA  STATISTICAL  THEORY 

For  a  zinc-blende  alloy  of  the  form  A,  _  .B.C  at  a  given 
temperature  T,  the  mixing  free  energy  AF  in  the  GQCA  is 
given  as 


^F=^E-T^S=  -TAS.  (1) 


where  .V/^  is  the  number  of  clusters  in  the  alloy  with  excess 
cluster  energy  and  5  is  the  configurational  entropy. 

A5  =  k  In  g  .  (2) 


where  g  is  given  as 


g  = 


•V 


M' 


n(< 


UK'  ' 


(3) 


In  Eq.  (3).  arc  the  number  of  A(B)  atoms  in  the  alloy 
with  .V  =  .\\  -n  Sb.x°  is  the  a  priori  fraction  of  clusters  of 
the  j  type  ( random  distribution ).  and  SI  is  the  total  number 
of  clusters  in  the  alloy  ( e.g..  M  =1  Sf^).  Using  Eqs.  ( 2 )  and 
(3).  Eq.  ( 1 )  can  be  rewritten  as 


AF=  Sf 


y  e  Xj  +  kT^Xi{\n  x/xj) 


+  .VA.T[jt  In  Jc  -r  ( 1  —  .t)ln(  I  -  x) )  . 


(4) 


where  x^  =  Sf^/Sf  is  the  fraction  of  clusters  with  energy  e 
The  equilibnum  values  of  the  probabilities  {x  }  are  ob¬ 
tained  by  minimizing  AFwith  respect  tox^  while  obeying  the 
following  constraints  on  the  alloy  composition 


y  x,  =  1  . 


(5) 


y  n,  ( B  )x^  =  nx  . 

where  n  is  the  number  of  alloying  atoms  (A  and  B)  per 
cluster.  The  result  of  a  variational  calculation  (equivalent  to 
a  steepest  descent  argument  given  in  Ref  1 )  of  AFin  Eq,  (4) 
with  respect  to  Xj  subject  to  the  constraints  of  Eq.  ( 5 )  leads 
to  the  following/th  order  equation  for  an  effective  activity 
coefficient  (tj) 

nx  -  ^rfj  g/q  .  (6) 

where  q  is  the  single-cluster  grand  partition  function. 

q=  y  7*8,  ,  (7) 

and  g,  is  the  number  of  ways  of  arranging  the  alloying  atoms 
in  a  cluster  with  energy 

g,  =  n\/f.Kn  -  j)\ .  (8) 

Once  7)  is  found,  the  cluster  population  probabilities  are  ob¬ 
tained  via  the  equation 


.)  V;»r  T^rhnnl  .hil/Aito  lOflP  n?  U.7101 /««'047«>4n.n7^ni  PO 
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which  should  be  compared  with  the  random  disinbuiion 
y,’  =  g,r{l  -  jc)-  (10) 

A  variational  calculation  w  ith  respect  to  alloy  volume  is  not 
performed  because  it  has  been  shown  experimentally' "  that 
the  corresponding  change  in  alloy  volume  with  concentra¬ 
tion  follows  Vegard’s  law.  Calculations  done^  show  that  the 
deviation  of  average  bond  lengths  from  the  simple  weighted 
average  is  <0.01  A  for  all  Ill-V  and  Il-Vl  alloys  studied. 
The  free  energy  can  now  be  calculated  from  Eq.  (4),  once 
the  cluster  energies  }  are  known. 


III.  CLUSTER  ENERGIES  AND  BULK  MODULUS 

The  alloy  interactions  in  a  real  pseudobinary  cannot  be 
accurately  described  by  the  simple  fixed-value  pair  energies 
often  assumed  in  statistical  models.  This  is  particularly  true 
for  an  alloy  with  sizable  bond-length  difference  (e.g., 
Hg|  _  ,Znj,Te),  where  the  strain  energy  dominates  the  mix¬ 
ing  enthalpy.  To  write  the  A£  as  a  sum  of  cluster  energies 
{e^ }  in  the  GQCA,  each  effective  f,  must  include  all  the 
interactions  inside  the  cluster  plus  one-half  of  all  the  interac¬ 
tions  across  the  cluster.  This  can  be  achieved  by  embedding  a 
cluster  in  an  effective  alloy  medium. ■*  The  smallest  cluster  to 
use  is  a  four-bond  cluster  centered  at  the  alloying  A  and  B 
atoms.  This  choice  of  cluster,  however,  does  not  yield  infor¬ 
mation  about  the  local  correlation.  The  next  smallest  reason¬ 
able  cluster  that  considers  local  correlation  is  the  16-bond 
cluster  with  a  central  C  atom  and  four  alloying  atoms  bound 
to  the  environment  by  1 2  second-neighbor  C  atoms  (Fig.  I). 
Note  that  this  16-bond  cluster  differs  from  the  five-atom, 
four-bond  cluster  used  by  other  workers.’"''  The  16-bond 
cluster  choice  means  that  the  counting  scheme  for  the  con¬ 
figurational  entropy  given  by  Eq.  ( 3 )  is  exact,  since  no  two 
clusters  share  the  same  alloying  atoms.  Approximations  in 
the  theory  occur  in  the  cluster  energy  calculation,  where  the 
cluster-cluster  interactions  are  treated  as  a  mean-field  theo¬ 
ry 

For  the  cluster  energy  calculation,  {e^},  we  place  the  16- 
bond  cluster  in  an  alloy  environment  and  then  allow  the 
cluster  to  relax.  The  resulting  strain  energy  associated  with 
bond  length  lattice  mismatches  is  treated  with  the  valence 
force  field"’  (VFF)  as  in  Ref.  II  and  other  cluster  model 
calculations.  The  strain  energy  in  the  VFF  is  expressed  as 


Fic.  I  The  basic  microciusier 
unif  used  for  A,  .,B,C  line- 
blende  compounds.  Ii  includes  a 
central  C  atom,  four  neighboring 
A  and  B  alloying  atoms  in  the 
first  shell,  and  12  second  neigh' 
bors  to  the  central  atom  tn  the 
second  shell  (shared  with  other 
clusters) 


J  Vac,  Sci.  Technol.  A.  Vol,  6.  No.  4.  JuI/Auq  1968 


a  (Id  d,  )’ 

oa  ‘ 

^  ^  ^  •  111 

*7 

where  d  is  the  average  alloy  bond  length.  Ad,  d  is  the 
change  of  the  scalar  product  of  two  bond  vectors  due  to  dis¬ 
tortion,  and  a  and )3 are  the  two-body  radial  and  three-body 
angular  force  constants,  respectively  . 

The  excess  energies  due  to  the  change  in  chemical  envi. 
ronment  in  an  alloy  are  treated  using  Harrison's  model.’-' 
where  it  is  known  as  the  metallicity  .  For  a  pair  of  AC  and  BC 
bonds,  this  chemical  energy  is 

e,.„„(A,B)  =  Af„(A) -t- Ae,„(B)  ,  (12; 

where 


Af,„(A) 


(Qb  ■ 

5,(A)  -  e„(B) 


\(a^  H.bJ  -  \ 

fi,  ( A )  -  ( A )  / 

(13; 


and  a  similar  expression  for  Ae,,  (B).  In  Eq.  ( 13),  A  and  B 
represent  AC  and  BC  bonds,  respectively,  and  f*  and  are 
the  energies  of  the  bonding  and  antibonding  states  denoted 
by  |6  )  and  |a),  respectively. 

To  obtain  the  cluster  energies,  we  are  required  to  sohe  the 
problem  of  finding  the  equilibrium  configurations  of  the 
atoms  in  the  16-bond  cluster  embedded  in  an  infinite  alloy 
medium  with  correct  bond  lengths  and  elastic  constants ' 
While  we  have  yet  to  carry  out  the  numerical  problem  to  its 
desired  accuracy,  we  have  investigated  two  different  ap¬ 
proximations  which  simulate  the  alloy  environment.  For  the 
first  medium,  hereafter  referred  to  as  the  discrete  modii.  the 
cluster  is  placed  in  a  medium  in  which  all  atoms  on  the  third 
shell  and  beyond  are  fixed  at  their  virtual-crystal  approxima¬ 
tion  ( VCA)  positions.  It  was  shown  in  Ref  1 1  that  the  trun¬ 
cation  of  the  relaxation  medium  overestimates  the  actual 
strain.  To  counteract  this  overestimation,  the  bond-angle 
terms  in  the  VFF  across  the  cluster  are  discarded,  while 
those  within  the  cluster  are  retained.  This  same  procedure 
was  done  for  the  case  of  impurities  in  alloys  with  good  results 
for  the  bond  lengths  and  mixing  enthalpies,."  The  second 
medium,  or  elastic  continuum  model,  replaces  the  atoms 
outside  the  second  shell  by  an  elastic  medium.  We  further 
assume  that  all  atoms  in  the  elastic  medium  (i.e..  second- 
shell  atoms  and  beyond  the  second-shell  radius  R )  only  ha\  e 
radial  displacements  which  are  inversely  proportional  to  the 
square  of  the  radius.  The  strain  energy  for  the  elastic  contin¬ 
uum  is  now  given  as" 

(14 

where  u  is  the  displacement  at  R  and  C  is  an  effective  sheai 
stiffness  coefficient  given  by  C  =  1.6r-  (C,,  —  C,.  -t- 
Table  I  lists  the  values  of  the  bond  lengths  d,  the  force  con 
stants  a  and  P,  and  the  coefficients  C  for  the  three  11- VI 
systems  studied. 

In  the  calculation  of  a  particular  cluster  energy,  the  sum  oi 
the  chemical  energies  within  the  cluster  plus  the  strain  eiicr 
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T  iHi  fc  I  The  txmd  lengih\  J.  lorce  conMjms  <i  jnd  li.  jnd  eftecii'e  ^hejf 
^iielficicni',  C  for  the  ihrcc  11-V  Komp^)und^  i  For  jdeijilcddeseripiionof 
these  values,  see  Ref  1 1  ) 
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gies  of  tb  -iuster  and  the  medium  is  varied  with  respect  to 
the  posit  IS  of  the  cluster  atoms  until  a  minimum  value  f„,„ 
IS  reachi  The  cluster  energy  €„  for  the  A(4  —  n)B(d)C 
cluster  becomes  the  sum  of  plus  one-half  the  averaged 
chemical  energies  of  this  cluster  with  the  cluster  medium; 

-I-  9/21n(  1  -  jt)  -f  (4  -  . 

(15) 


(bl  « 

FiC.  2.  Hgf4  -y)Zn<  y)Tff  ciusJer  energies  per  J6-bond  microclustcr  as  a 
function  of  ZnTc  concentration  for  clusters  with  differing  numbers  of  Zn 
atoms  (a)  discrete  model  and  (b)  elastic  continuum  mode! 


The  long-range  Coulomb  interactions-  were  also  estimat¬ 
ed  and  found  to  be  very  small  compared  to  the  strain  and 
chemical  energies  Energy  corrections  that  occur  due  to 
shifts  in  d-level  energies  via  first-neighbor  p-d  interactions 
with  antibonding  states  were  also  estimated  and  found  to  be 
small  (  <0  1  kcal  moli  In  this  p-d  interaction,  it  is  sur¬ 
mised  that  the  shifts  due  to  the  cation  d  levels  counterbal¬ 
ance  the  shifts  due  to  the  anion  occupied  p  orbitals,  leaving 
the  average  energy  relatively  intact. 

For  the  calculation  of  the  bulk  modulus'*  B.  one  calculates 
the  change  of  the  average  energy  response  of  the  clusters  in  a 
compressed  medium  (with  e.xternal  pressure  5P) 

S£  =  .V/  >  .  (lb) 

which  in  turn  y  ields  the  bulk  modulus  from  the  relationship 

6E  =  {&P)-l-/[2B)  .  (!') 

where  1'  is  the  volume  of  the  alloy.  Results'*  show  the  bulk 
modulus  of  the  pseudobinary  alloy  as  a  function  of  concen¬ 
tration  bows  only  slightly  from  a  straight-line  average  (fl). 

5=fi|l  -  ltd  -x)(Afi/fl)-]  .  (18) 


Fio-  3  Hg(4  - y)Cd<  y)Te  cluster  energies  per  Ib-b^md  microcluster  as  a 
function  of  CdTe  concentration  for  clusicn  with  differing  numbers  of  Cd 
atoms,  (a)  discrete  model  and  ( b)  elastic  continuum  model 
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where  the  fractional  change  in  the  bulk  modulus  is  <  1%  for 
all  cases. 

IV.  RESULTS 

Figures  2(a)  and  2(b)  show  the  cluster  energies  per  mi- 
crocluster  of  Hg,  _  ,Zn,Te  for  microclusters  with  differing 
numbers  of  Zn  atoms.  Figure  2(a)  uses  the  discrete  model 
while  Fig.  2(b)  uses  the  elastic  continuum  model  for  the 
alloy  medium.  Note  that  while  the  energies  for  the  discrete 
model  are  almost  symmetric,  those  from  the  elastic  contin¬ 
uum  model  are  rather  asymmetric  about  the  stoichiometric 
composition  x  =  0.5.  This  is  due  to  the  fact  that  the  averaged 
a  value  varies  slowly  while  the  averaged  C  value  changes 
considerably  from  HgTetoZnTe  (seeTable  1).  Comparison 
of  the  cluster  energies  between  the  two  models  for 
Cd,  _,Zn,Te  is  similar  to  that  for  Hg,  _,Zn,Te.  The  re¬ 
sults  for  Hg|  _  ,Cd,Te,  however,  are  different”  (Figs.  3(a) 
and  3(b)  ]  from  the  other  two  alloys  in  that  the  strain  energy 
is  not  dominant.  Since  there  is  very  little  variation  of  the )? 


Fig.  4.  Deviations  of  the  Hg(4  — y)Zn(  y)Te  cluster  population  from  those 
of  a  random  alloy  as  a  function  of  ZnTe  concentration  for  clusters  with 
diffenng  numbers  of  Zn  atoms:  ( a )  discrete  model  and  ( b )  elastic  contin¬ 
uum  model. 
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values  across  the  alloy  concentration,  the  cluster  energies  are 
more  or  less  symmetnc  about  x  =05  for  both  models 
Figures  4(a)  and  4(b)  show  Hg,  ,  ^Zn^Te deviation  from 
randomness  XXj  —  x^  —  x°  for  both  alloy  models  at  1000  K 
Similar  results  were  obtained  for  Cd,  _  ,Zn,Te.  Note  thai 
while  both  discrete  and  elastic  continuum  models  predict 
very  small  deviations  from  a  random  alloy,  the  tendency  of  a 
particular  cluster  to  be  greater  or  less  than  a  random  distri¬ 
bution  differs  for  each  model.  Although  the  signs  of  A-x^  for 
Cd,_,Zn,Te  in  the  elastic  continuum  model  agree  with 
those  obtained  from  a  measurement  of  the  chemical  shifts.” 
the  magnitudes  of  the  experimental  appear  to  be  much 
larger  than  the  calculated  values.  On  the  other  hand,  the 
discrete  model  predicts  a  small  reverse  trend.  If  the  experi¬ 
ments  are  correct,  then  there  is  some  major  interaction  miss¬ 
ing  from  all  models  published  to  date.  The  cluster  popula¬ 
tions  for  Hg, .  ,Cd,Te  (discrete  model)  are  given  in  Fig  ti 


Fig.  5.  Cd, .  ,Zn,Te  alloy  average  CdTc  and  ZnTe  bond  lengths  as  a  func 
tion  of  ZnTe  concentration  The  solid  labeled  curves  are  the  average  value's 
the  dashed  curves  are  the  extent  of  the  rms  variations,  and  the  heav)  soit  l 
line  is  the  average  bond  length  corresponding  to  Vegard'srule:  (a)  dtscreie 
.lodel  and  (b)  elastic  continuum  model.  Experimental  data  from  Ref  1 5 
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FiO  6  A,  ,B,C  alloy  averagt  constituent  bond  lengths  as  a  function  of  Bconceniralion  The  solid  labeled  curves  are  the  average  values,  the  dashed  curves 
are  the  extent  of  the  rms  variations,  and  the  heavy  solid  line  is  I  he  average  bond  '.  corresponding  to  Vegard's  rule  (a)  GaAs,  ,P,.  discrete  model,  data 
from  Ref  lb.  (b)  GaAs,  .P,.  elastic  continuum  model,  data  from  Ref  16;  'cl  In,  ,vja..As.  discrete  model,  data  from  Ref  5;  (d)  In,  ,Ga,  As.  elastic 
continuum  mixlel.  data  from  Ref  $.  (e)  Zn,  ,Se,Te.  discrete  model,  data  from  Ref.  6;  and  If)  Zn,  ,Se,Te.  elastic  continuum  model,  data  from  Ref  b 
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of  Ref.  13.  [Please  note  a  correction  in  Figs.  5  and  6  of  Ref. 
13  in  that  the  cluster  labels  (Cd)  =4  and  (Cd)  =  0  in 
both  figures  are  reversed  )  While  there  is  still  a  small  devi¬ 
ation  from  randomness,  the  overall  trend  of  the  cluster  popu¬ 
lations  IS  different  than  the  discrete  results  presented  here  for 
Hg,-  ,Zn,Teand  the  experimental  results  of  Ref  14.  Since 
the  magnitudes  of  the  calculated  cluster  populations  are  very 
small  and  the  experimental  results  for  Hg,  _  ,Cd,Te  are  not 
as  simple  as  for  Hg,  _^Zn,Te.  we  have  not  attempted  to 
make  a  comparison. 

The  Cd|  .,Zn,Te  bimodal  bond-length  distributions  at 
1000  K  along  with  the  rms  deviations  and  average  values  are 
shown  in  Figs.  S(a)  and  S(b)  for  both  alloy  models.  Com¬ 
parison  of  the  two  shows  that  the  elastic  continuum  model  is 
closer  to  the  experimental  values''  than  the  discrete  model. 
Similar  bimodal  results  were  obtained  for  Hg,  .,Zn,Te, 
but  different  results  for  Hg,  _  ,Cd,Te.  in  which  the  shorter 
HgTe  bond  gets  shorter  and  the  larger  CdTe  bond  gets  long¬ 
er  in  the  alloy.  The  average  bond  lengths  for  some  111-V 
alloy  and  II-Vl  alloys  for  which  experimental  data  are  avail¬ 
able'”' are  shown  in  Figs.  6(a)-6(f).  The  only  conclu¬ 
sion  that  we  can  draw  from  these  comparisons  is  that  both 
models  gave  a  semiquantitattve  prediction  of  the  bond 
lengths.  The  data  cannot  be  used  to  disqualify  either  model. 

The  phase  diagrams  for  the  three  II-Vl  alloys  using  the 
discrete  model  are  shown  in  Figs.  7-9.  The  solid  free  energies 
were  calculated  from  the  GQCA  while  for  the  liquid  free 
energies  we  have  adopted  a  simple  regular  solution  with  a 
single  adjustable  mixing  enthalpy  parameter  (e.g.. 
A£  =  flxf  1  —  .t)  j.Thesolidus/liquidus  lines  were  then  ob¬ 
tained  by  using  the  common  tangent  method'  along  with 
Vieland's  formula  but  with  no  further  adjustable  param¬ 
eters.  '  ’  The  parameters  used  for  each  phase  diagram  calcula¬ 
tion  are  given  in  the  figure  captions.  In  theCd, .  ,Zn,Teand 
Hg,  .  ,Zn  ,Te  cases,  the  discrete  model  gave  slightly  better 
agreement  with  experimental  data'""’'  than  the  elastic  con¬ 
tinuum  model.  ForHg,  .,Cd,Te.  where  the  strain  energy  is 
small,  the  two  models  give  the  same  curves. 


Fto  1.  Hg,  .  ,Cd,Te  phase  diagram  ( discrete  model).  Values  for  ihe  liquid 
interaction  parameter,  melting  temperature,  and  heats  of  fusion  are 
It  =  -  l.OOkcal/mol,  r;;,,,  =  943  K.  =  1365  K,  =  8.727 

kcal/mol.  and  =  12.012  kcal/mol.  Experimental  data  from 

Ref.  18. 

J.  Vac.  Sci.  Technol.  A,  Vol.  6.  No.  4.  Jul/Aug  1988 

16 


Fic  8  Hg,  .Zn.Tephasediagram  (discrete model)  Values forthe liquid 

interaction  parameter,  melting  temperature,  and  heats  of  fusion  arc 
11  =  -  3  50  kcal/mol,  =  943  K.  FJ-.y,  =  1560K.AW-,,  =  8 'r 
kcal/mol.  and  a/f  j,,.  =  1 5  600  kcal/mol  Expenmental  data.  A  Ref  19, 
•  Ref  20 


V.  SUMMARY  AND  DISCUSSION 

A  generalized  quasichemical  approximation  has  been  de¬ 
veloped  to  treat  the  local  correlation  and  free  energies  for 
semiconductor  alloys.  The  two  approximate  alloy  models 
considered  produce  semiquantitative  results  for  bond 
lengths,  mixing  energies,  and  phase  diagrams.  While  both 
models  predict  small  deviation  from  randomness  for  all 
three  11-Vl  alloys  studied  here,  the  population  is  sensitive  to 
the  model  choice.  However,  the  procedure  has  not  been  car¬ 
ried  out  for  the  infinite  medium.  Future  work  will  incorpo¬ 
rate  more  exact  models  for  the  alloy  medium.  In  addition, 
better  liquid  free  energy  models  should  be  used  for  the  phase 
diagram  calculation.  However,  more  systematic,  detailed, 
and  accurate  experimental  measurements  on  these  quanti¬ 
ties  are  needed  to  provide  checks  on  the  theory.  Funher- 


Fig.  9  Cd, .  .Zn.Tephasediagram  (discrete  model).  Values  for  the  liquid 
interaction  parameter,  melting  temperature,  and  heats  of  fusion  arc 
n=  +0.160  kcal/mol.  =  >365  K,  =  1560  K.  A//*. 
=  12.012  kcal/mol.  and  A//2at<  =  15.6(X}  kcal/mol.  Experimental  dau 
from  Ref.  21. 
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more,  the  theoretical  results  (particularly  the  atomic  disin- 
butions  and  the  bond  lengths )  should  be  used  to  study  more 
properties,  e.g  ,  surface  ordenng  and  segregation,  phonon 
spectra,  deep  levels,  transpon.  nuclear  magnetic  resonance, 
etc.,  to  have  a  broader  check  and  application  of  the  theory. 
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CORRELATIONS  AND  ALLOY  PROPERTIES: 

GROWTH,  VACANCIES,  SURFACE  SEGREGATION  • 

A.  SHER  and  M.A.  HERDING 
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and 

A.-B.  CHEN  and  R.S,  PATRICK 

Physics  Department.  .Auburn  University .  .Auburn.  Alabama  36R49.  L'S-A 


The  properties  of  alios  semiconductors  are  influenced  by  the  state  cif  order  of  the  materials.  Some  properties,  such  as  the  liqaidus 
and  solidu.'i  curves,  are  relative  insensitive  but  others  are  strongly  affected  We  first  demonstrate  which  aspects  of  bonding  interaction 
mechanisms  impact  the  correlation  state.  W'e  then  deduce  the  effect  of  the  correlation  state  on  liquidus  and  solidus  curves,  on  surface 
segregation,  and  on  vacancy  densities  Surface  segregation  is  a  concentration  vanation  of  the  alloy  constituents  away  from  surfaces, 
and  IS  dnven  by  Schotlky  vacancy-free  energy  differences  between  the  constituents  in  the  bulk  and  on  a  surface  The  segregation 
profile  for  alloys  equilibrated  above  a  cnttcal  spinodal  transition  temperature  exhibits  a  monatomic  variation  decaying  over  a  few 
atomic  layers  from  the  surface  to  the  bulk  value.  However,  long-range  concentration  oscillations  are  predicted  below  the  critical 
temperature  These  oscillations  may  account  for  the  long-range  order  observed  in  some  MBE-  and  MOCVD-grown  alloys  (e  g., 
InPSb).  The  total  free  energy  of  this  new  ordered  phase  lies  between  that  of  a  homogeneous  but  highly  correlated  alloy  and  that  of  a 
spinodally  decomposed  alloy  The  pha.se.  if  it  exists,  would  be  locked  into  its  metastable  state  by  the  presence  of  the  surface 
Ri'  irtd.tn  cond''i''r.s  a:  'he  int-rfacc  hev.veen  the  epitaxial  layer  and  the  substrate  would  also  affect  the  results 


1.  Introduction 

Many  groups  have  been  .studying  the  correla¬ 
tion  state  of  tetrahedrally  coordinated  pseudo-bi¬ 
nary  semiconductor  alloys  [1-4]  of  the  form 
A,  . ,  B,C.  They  all  begin  by  decomposing  the  solid 
into  microclusters,  usually  five  atoms  consisting  of 
a  central  C  atom  and  its  four  surrounding  neigh¬ 
bors.  The  excess  energy  of  the  cluster  relative  to 
the  concentrated  weighted  average  of  similar  clus¬ 
ters  of  the  AC  and  BC  compounds  is  then  calcu¬ 
lated  in  a  variety  of  formalisms  and,  while  there 
are  similarities  in  the  gross  trends,  the  groups 
generate  rather  different  detailed  excess-energy- 
versus-concentration  curves.  These  excess  energies 

*  This  work  was  supported  by  AFOSR  Contract  F49620-88- 
K-0009,  NASA  Contrast  NASl-18226,  ONR  Contract 
N00014-88-C-0096.  and  NASA  Grants  NAG-1-708  and 
NASl-18232. 


are  then  inserted  into  statistical  mechanics  expres¬ 
sions  for  the  populations  of  the  various  microclu.s- 
ters.  and  the  excess  free  energy  is  then  calculated. 
While  all  the  groups  use  grand  canonical  ensem¬ 
bles  for  the  microclusters,  they  differ  in  their 
treatment  of  the  entropy  that  appears  in  the  free 
energy.  A  number  of  alloy  properties  are  calcu¬ 
lated.  e.g..  the  alloy  bond-length  variation  [1-8], 
the  mixing  enthalpy  [1-8],  the  critical  order-dis¬ 
ordered  transition  temperature  and  low-tempera¬ 
ture  phases  [6|.  the  cluster  populations  as  a  func¬ 
tion  of  temperature  [1-8],  and  liquidus  and  soli¬ 
dus  phase  diagrams  [9].  Despite  the  differences  in 
calculation  methods,  the  bond-length  variations 
and  mixing  enthalpies  predicted  by  the  different 
groups  are  nearly  the  same  and  agree  reasonably 
well  with  experiments  [5,10].  There  are  larger  dif¬ 
ferences  among  the  predictions  of  other  proper¬ 
ties:  unfortunately,  they  are  also  less  susceptible 
to  direct  experimental  measurement. 
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In  this  paper  we  first  identify  the  reasons  for  a 
lack  of  sensitivity  to  some  details  of  the  theory, 
and  which  aspects  of  the  theory  must  be  correct  to 
make  accurate  correlation-state  predictions.  We 
then  display  some  solidus  and  liquidus  curves  and 
e.vamine  the  sensitivity  of  distnbution  coefficients 
to  features  of  the  theory.  Next,  we  present  vacancy 
energies  for  several  alloys  and  draw  tentative  con¬ 
clusions  about  their  effects  on  diffusion,  doping, 
and  ion  beam  processing.  Finally,  we  present  fea¬ 
tures  of  the  theory  of  surface  segregation  in  semi¬ 
conductor  alloys. 


2.  Correlations 

2.1.  General  theory 

We  and  others  [2-41  have  demonstrated  that 
there  are  always  correlations  in  alloy  semiconduc¬ 
tors.  Here  we  summarize  our  previous  >vork.  locus- 
ing  attention  on  those  aspects  that  may  help  us  to 
identify  what  may  be  missing  from  the  interac¬ 
tions  dnving  correlations  and  those  features  to 
which  correlations  are  insensitive.  We  consider  an 
average  population  of  five-atom  clusters  includ¬ 
ing  sixteen  bonds  (most  others  treat  four  bonds 
only).  Five-atom  clusters  of  the  form  Aj.,,  B„C 
can  he  shown  to  be  distributed  in  grand  canonical 
ensembles  of  the  form 

=  -  (1) 

where  the  partition  function  is 

I  (2) 

and  is  the  degeneracy  of  cluster  j  =  0. 

1 . 7(4  <J  <2* ).  is  the  number  of  B  atoms 

on  cluster  j.  is  the  excess  energy  of  cluster  j 
relative  to  the  average  e  =  (1  -  x  )«^  +  .xcb  (called 
by  Sher  et  al.  [1]).  and  tg  are  cluster 
energies  of  the  pure  AC  and  BC  compounds,  k  is 
the  Boltzmann  constant,  and  T  is  the  absolute 
temperature. 

If  the  clusters  do  not  have  their  normal  count¬ 
ing  degeneracies  g”  =  (^  )  split,  then  we  have  g^  = 


g?.  the  binomial  coefficient,  and  7  =  4.  The  chem¬ 
ical  potential  is  set  by  insisting  that  the  composi¬ 
tion  X  is  correct. 

4.V  =  (3) 

J 

If  we  make  a  transformation  to  the  reduced 
excess-energies  representation. 

=  (l  -  (4) 

(called  A\  by  Sher  et  al.  [1]).  then  x^  becomes 

-v,  =  K,  m'b)  (5) 

where 

Mb  =  Mb  ~  “  ‘o)-  (6) 

Note  that  we  have  A„  =  Ji^  =  0.  so  at  least  two 
members  of  the  set  { A^ }  vanish.  More  important, 
only  the  reduced  excess  energies  actually  drive  the 
cluster  populations.  We  shall  see  shortly  that  the 
energies  of  the  { A^ }  set  are  often  five  to  ten  times 
smaller  than  those  of  the  { )  set.  Most  of  the 
strain  contribution  to  the  {c,}  set  subtracts  out 
and  contributes  little  to  the  x^  values. 

There  is  an  additional  effect.  It  can  best  be 
appreciated  when  stated  as  a  theorem.  If  (a)  A,  = 
-i„  -i-  n^8.  and  if  (b)  g,  =  (^  ).  then  7  =  4  and 

.x^  =  .x;  =  g;(i (7) 

the  populations  of  a  random  alloy.  Thus,  no 
matter  how  large  the  constant  6  may  be.  as  long 
as  the  energies  A^  vary  linearly  with  the  number  of 
B  atoms  on  the  cluster  ttj  and  th-.  counting  degen¬ 
eracy  is  not  split,  the  alloy  is  random.  In  fact 
6  =  0.  since  we  have  ily  =  =  0.  Strain  energies 

from  bond-length  differences  between  the  con¬ 
stituents.  chemical  energies,  and  electron-electron 
interactions  all  have  some  nonlinear  variations 
with  n^.  The  degeneracy  g°  is  split  by  coherent 
strains  or  temperature  gradients,  anything  that 
establishes  preferred  directions  for  the  locations  of 
A  and  B  atoms  on  a  cluster.  These  splittings 
always  establish  a  preference  for  one  type  of  clus¬ 
ter  in  a  particular  orientation  (e.g..  for  a  strain  in 
the  (111)  direction,  A3B  with  the  B  oriented  along 
the  (111)  axis),  and  therefore  drive  deviations  of 
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the  {.v^}  set  from  {.v'^j  toward  compound  forma¬ 
tion.  and  if  the  splittings  are  large  enough,  even 
toward  long-range  order.  The  effect  of  the  en¬ 
ergies  {J^}  can  dnve  the  ( }  set  toward  either 
compound  formation  or  spinodal  decomposition, 
depending  on  the  relative  bond  length  and  bond 
energy. 

The  excess  free  energy  of  an  .A; .  ,B,C  alloy  can 
then  be  written  as 


1F  =  1E-T1S.  (S) 

w  here  the  excess  energy  is 

(9) 

J 

with  M  being  the  number  of  clusters.  The  excess 
entropy  is 


k  In 


I  .V! 


■'a-  ^'h- 


j 


n 


.V/! 


X 


(10) 


with  .V/  =  A'^.V/  (see  Sher  et  al.  [1]  for  the  justifica¬ 
tion).  The  total  number  of  clusters.  .V/,  is  related 
to  the  number  of  Bravais  lattice  sites,  in  the 
crystal  by  M=\\  for  a  16-bond  cluster  and 
A'  =  ,V  for  a  4-bond  cluster,  Eqs.  (9)  and  (10)  can 
be  rewritten  for  the  16-bond  cluster  as 


AE=  JA’ 


(1  -  .V)€„  -I- A€y 


(11) 


and  in  general  the  excess  entropy  is 


AS  =  -ky 


+  ~'L(x,  In  In  x°) 


( 1  -  A )  ln(  1  -  A )  -I-  A  In  A 


fin 


(i:) 


Note  in  the  first  equality  in  eq.  (12)  that  M/N  is 
unity  for  a  4-bond  cluster  and  1  /4  for  a  cluster 
with  16  bonds.  In  the  16-bond  cluster,  after  sub¬ 


stitutions  from  eqs.  (5 1  and  (7),  the  (l-.v)  Ini  1  - 
-X )  -I-  .V  In  A  term  is  exactly  canceled  by  a  portion 
of  the  third  term  in  the  brackets.  This  occurs 
physically  because,  in  the  16-bond  cluster,  all  the 
bonds  associated  with  a  given  substituted  atoms 
are  in  the  cluster;  in  the  4-bond  case,  however, 
bonds  from  each  substituted  atom  contnbute  to  4 
different  clusters.  Several  features  of  eqs.  (11)  and 
(12)  are  noteworthy.  First,  x,,  and  tj  are  functions 
of  A  but  in  lowest  order  are  temperature  indepen¬ 
dent.  For  bond-length-mismatched  alloys.  the\ 
vary  with  a  roughly  as  =  ell.v'  and  tj  =  Eji  1  - 
.v)‘.  so  that  we  have 


AE  =  .V 


.Qa(  1  -  a  )  -I- 


(  13) 


where  the  mixing  enthalpy  parameter,  f?.  is  ap¬ 
proximately 


and  nearly  independent  of  .v  if  =  tf.  as  is  often 
the  case.  The  second  term  in  eq.  (13)  is  usually 
small  compared  with  the  first  and  contains  all  the 
temperature  dependence.  However,  when  eqs. 
(12).  and  (13)  are  inserted  into  eq.  (8)  to  obtain 
AF.  a  term  m  TAS  exactly  cancels  the  second  term 
in  eq.  (13).  Thus,  it  is  impossible  to  determine  the 
temperature  variation  of  J£  from  a  measurement 
of  AF.  The  expression  for  AF  becomes 


df  =  .v[i2A(l  -  a)  +  .tju; 

lnt7(|-i,}.  Mb)1-  (I?) 

The  chemical  potential  is  determined  from  eq. 
(3)  and  both  jUg  and  q  are  generally  temperature 
dependent. 

In  a  much  different  formalism  than  ours  [1] 
and  under  more  highly  constrained  conditions. 
Ferreira  et  al.  [4]  later  reached  some  of  the  same 
conclusions  presented  in  this  section.  The  result 
that  the  {.x^}  set  depends  only  on  {A,},  and  that 
because  most  of  the  strain  contribu¬ 
tions  cancel  in  the  set,  is  a  direct  conse¬ 

quence  in  our  approach  of  the  concentration  con¬ 
servation  constraint,  eq.  (3).  No  assumptions  r.‘- 
stricting  A^  to  be  a  independent  are  made  in  our 
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(ic^elop^1ent.  and  in  faci  we  find  J  does  have  a 
slow  variation  with  a  in  the  examples  presented  in 
the  next  section. 

2.  d.  Sumerical  results 

We  have  calculated  { (.],  [  J ,  J.  Mb-  I  }• 

A  til  T I.  and  AF{T)  for  a  number  of  allocs;  fig.  1 
show s  those  for  Ga ,  ,  In  ,  .As.  The  energies  e ,  were 
calculated  as  discussed  in  detail  in  ref.  [1|.  The 
sixteen-bond  cluster  was  attached  to  rigid  medium 
at  the  virtual  crvstal  positions  in  the  third  shell 
from  the  cluster  center.  The  atom  positions  inside 
the  third  shell  were  adjusted  to  minimize  the  strain 
and  chemical  energies.  The  energs  <,  is  the  total 
energs  of  the  sixteen  bonds  with  the  atoms  in  their 
minimum  energx  positions.  The  variations  of  atom 
bond  lengths  in  different  allocs  are  well  predicted 
be  this  method. 

The  variation  of  ;e  ;  with  a,  and  that  of  j  ) 
are  quite  different.  As  is  evident  from  the  curves, 
the  ;  J  j  set  an  order  of  magnitude  smaller  than 
the  i  <  [  values,  and  most  of  the  curvature  identi¬ 
fied  with  the  strain  contribution  has  subtracted 
out.  The  excess  enthalpc  has  onlc  a  small 
temperature  dependence,  as  we  anticipated.  The 
excess  free  energy  J/-'  at  6Ui)  K  has  a  negative 
curvature  at  v  =  t.  which  indicates  that  the  criti¬ 
cal  transition  temperature  is  below  60(J  K. 

The  reasons  why  bond  lengths  and  mixing  en¬ 
thalpies  agree  cvith  experiments  in  a  variety  of 
quite  different  cluster  model  approximations  are 
discussed  exten.siveic  in  refs.  [2.8.9|  The  underly¬ 
ing  cause  for  the  insensitivity  of  bond  lengths  is 
that  if  the  region  outside  the  cluster  is  allowed  to 
relax  in  the  energy-minimization  procedure  u.sed 
to  identify  the  bond  lengths,  the  strain  from  the 
angle-restonng  forces  tends  to  be  canceled.  Thus, 
models  that  include  angle-restonng  forces  and 
some  outside  relaxation  or  that  ignore  angle-re¬ 
storing  forces  and  outside  relaxation  both  predict 
the  bond  lengths  correctly.  However,  the  two 
classes  of  models  predict  different  sets.  The 
set  based  on  a  stiff  exterior  (no  relaxation)  is 
larger  with  sharper  x  vanations.  However,  as  we 
have  .seen,  most  of  this  strain  contribution  sub¬ 
tracts  from  the  {.A,}  set.  Thus  the  populations 
{  V  1  tire  not  as  different  in  the  two  types  of 


approach  as  might  have  been  expected.  The  excess 
entropy  AE  shi'uld  differ  However,  the  various 
groups  also  include  chemical  energies,  and  those 
Using  stiff-extenor  approximations  often  report 
larger  negative  chemical  terms  so  mi.xing  enthalpv 
parameters  agree  with  experiment  There  is  no 
general  theory  in  disordered  alloys  of  how  to 
calculate  cluster  excess  energies,  for  which  the 
small  parameter  of  the  theorv  is  defined.  While  a 
start  was  made  on  this  theory  [11].  a  more  com¬ 
plete  version  is  needed  to  make  convincing  pro¬ 
gress. 

We  have  also  show  n  that  onlv  the  small  { J  ' 
set  dnves  deviations  in  the  populations  .v  awav 
from  randomness  Moreover,  if  A  =  8n  is  linear 
in  n  the  number  of  B  atoms,  then  despite  the  size 
of  S.  the  populations  are  still  random.  Thus,  ef¬ 
fects  that  would  otherwise  be  considered  small 
may  compete  with  the  larger  energies  retained  if 
they  have  the  proper  nonlinearitv .  \^'e  are  examin¬ 
ing  several  possibilities  not  incorporated  m  our 
present  results.  These  include  effects  caused  bv 
screening  in  the  composition  range  where  narrow - 
gap  alloys  are  semimetal.  direct  second-neighbor 
chemical  interactions  treated  in  the  context  of 
Harrison's  model,  electron-electrim  Coulomb  in¬ 
teractions  driven  by  polarity  differences  between 
the  constituents,  and  compi'sinon  variations  of  the 
elastic  constants. 


3.  Solidus  and  liquidus  phase  diagrams 

We  have  reported  on  high-temperature  phase 
diagrams  of  a  number  of  alloys  [11];  two  examples 
are  presented  m  figs.  2  and  3.  The  solidus  and 
liquidus  curves,  figs.  2a  and  2b.  were  calculated 
using  our  solid  free  energy,  following  the  logic  of 
Stnngfellow  and  Green's  [12]  modification  of  V'le- 
land's  [13]  theory  and  their  liquid  free  energy.  As 
can  be  seen,  the  agreement  with  expenment  is 
excellent  for  GalnP,  but  that  with  Galn.As  is 
poorer.  To  demonstrate  the  sensitivity  of  these 
curves  to  aspects  of  the  theory,  figs.  2b  and  3b 
show  the  distribution  coefficients  (the  ratio  of  the 
.solidus  to  liquidus  concentration  at  a  given  tem¬ 
perature)  plotted  against  the  resulting  solid  con¬ 
centration  for  the  correlated  model  and  for  an 
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ideal  random  alloy  using  the  same  enthalpy 
parameters.  As  can  be  seen,  a  regular-solution 
random  model  differs  substantially  from  the  corre- 
lated-alloy  case. 


4.  Vacancies 

Vacancy  formation  energies  have  been  calcu¬ 
lated  for  semiconductors  and  their  alloys  that  take 
explicit  account  of  the  final  state  to  which  the 
removed  atom  goes  as  well  as  bond-lengths  relaxa¬ 


tion.  charge  redistribution,  and  dangling-h\ bnd 
interaction  modificaiioni>  to  the  lattice  surround¬ 
ing  the  vacated  site  [14.15].  The  model  is  based  on 
a  tight-binding  Hamiltonian  [16j  with  unisersai 
scaling  that  yields  correct  cohesive  energies  The 
extraction  energies  Ejoz)  where  the  final  stale  is 
an  atom  at  infinity,  vacancy  energies  £jlll>  and 
£,(100).  where  the  atom  final  state  is  a  (111)  or 
( 100)  surface  site,  and  sublimation  energies  £,(  111) 
from  a  (111)  surface  are  in  table  1.  Notice  that 
£,(111)  is  not  much  different  for  the  anions  and 
cations  of  a  given  compound.  To  reach  the  ap- 
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solution  model. 


23 


••t  Sher  el  al  Correiauom  and  atloy  properties 


Table  1 

\aeanc>  formation  energies  for  removal  of  an  atom  from  bulk  to  rnfiniis.  fjoc).  to  a  inp!>  bonding  site  on  a  (111)  surface. 
£\  (11 1 1,  and  to  a  doubl>  bonding  site  on  a  (ICX))  surface,  £.(100).  and  the  sublimation  energy  from  a  tnply  bonding  site  on  the  ( 1 1 1 1 
surface.  £.(111);  all  energies  are  in  eV 


Elemeni 
or  compound 

£.(  X ) 

£,(111) 

£,(111) 

£.(100)  ■*' 

Canon 

Amon 

Canon 

Amon 

Cation 

Amon 

Cation 

.Amon 

Carbon 

19.25 

19.25 

5.93 

593 

13.32 

13  32 

11.89 

11  89 

Silicon 

10,32 

10.32 

2.82 

2.82 

7.50 

7.50 

5.68 

5.68 

Germanium 

8.11 

8.11 

2.10 

2,10 

6.01 

6.01 

4.23 

4.23 

Tin 

6.30 

6.30 

1.59 

1  59 

4.71 

4.71 

3.18 

3.18 

AlP 

10.65 

11.12 

3.55 

4.11 

7.10 

7.01 

6.62 

6.62 

GaP 

8  30 

8.76 

2.62 

2.83 

5.68 

5.93 

5.92 

4.02 

InP 

8.23 

8.83 

2.60 

3.52 

5  63 

5  31 

5.40 

4.70 

AlA.s 

8. "9 

9.65 

4.74 

5.10 

4.05 

4.55 

4.85 

6.03 

GaAs 

6  66 

7.81 

3  66 

3.76 

3.00 

4.05 

4.20 

3.75 

In.Ab 

6.78 

7.48 

3.56 

4.14 

3.22 

3.34 

3.94 

4.12 

AlSb 

4.70 

6.06 

2.72 

2.73 

1.98 

3.33 

2.61’ 

3.29 

GaSb 

5.95 

6.82 

3.19 

3.13 

2.76 

3.69 

4.32 

2.53 

InSb 

5.97 

6.28 

3.10 

3.16 

2.87 

3.1 

3.96 

2.69 

2nS 

6.90 

9.99 

4.21 

4.19 

2.69 

5.80 

5.00 

5.53 

CdS 

6.09 

8.'8 

3.72 

3.72 

2.37 

5.06 

4.17 

5.02 

HgS 

4.53 

7.78 

3.08 

3.08 

1.45 

4.''0 

4.35 

3.88 

ZnSe 

5.66 

8.66 

3.57 

3.55 

2.09 

5.11 

4.18 

4  98 

CdSe 

5.03 

7.76 

3.20 

3.20 

1.83 

4.56 

3.35 

4.60 

HgSe 

3.39 

6.77 

2.49 

2.52 

0.90 

4.25 

3.37 

3.39 

ZnTe 

5.10 

8.08 

2.09 

2.11 

3.01 

5.97 

4.23 

4.15 

CdTe 

4.68 

7.36 

1.90 

2.05 

2.78 

5.31 

3.65 

3.99 

HgTe 

3.10 

6.36 

1.50 

1.51 

1.60 

4.85 

3.57 

2.61 

■“  £.i'i00i  IS  an  estimate  only  and  does  not  include  back-bond  relaxation  at  the  surface. 


propnate  (111)  surface,  an  atom  breaks  four  bonds 
in  the  bulk  and  remakes  three  bonds  on  the  surface. 
The  differences  between  anions  and  cations  arise 
from  differences  they  cause  on  back-bonding  states 
adjacent  to  the  vacant  site.  The  sublimation  en¬ 
ergies  are  another  matter,  since  the  promotion 
energies  are  quite  different  for  anions  and  cations. 

Following  the  systematic  variations  of  extrac¬ 
tion  energies  is  a  convenient  way  to  examine  trends 
in  the  alloys.  The  variations  of  £.(00)  for  differ¬ 
ent  clusters  are  shown  for  the  common  arsenic 
and  common  gallium  alloys,  respectively,  in  figs.  4 
and  5.  for  .c  =  0.5.  The  variation  of  the  numbers 
with  X  is  fairly  small.  There  are  a  several  note¬ 
worthy  features  to  these  curves  that  will  signifi¬ 
cantly  affect  diffusion,  doping,  and  other  device 
processing  steps.  First,  as  would  be  expected,  vari¬ 


ation  with  cluster  type  is  more  prominent  for  the 
unalloyed  atom  type,  which  is  affected  by  the 
occupation  of  the  four  near-neighbor  sites;  the 
alloyed  atom  character  depends  on  the  occupancy 
of  the  twelve  second-neighbor  sites.  The  variations 
all  tend  to  bow  down,  some  by  large  amounts. 
This  means  that  vacancy  concentrations  in  alloys 
will  differ  greatly  from  those  that  would  be  calcu¬ 
lated  from  an  average  activation  energy. 

The  hop  process  will  also  be  affected.  For 
example,  in  GaP, _j,As^.  it  will  cost  more  energy 
for  a  Ga  on  a  PASj  cluster  to  hop  to  either  a 
PvAs,  or  an  As  4  cluster  than  to  hop  to  another 
PASj  cluster.  Thus,  for  high  x  (the  arsenic-rich 
side)  where  the  PAsj  cluster  population  Xy  is 
large,  we  might  expect  diffusion  profiles  character¬ 
istic  of  parallel  paths  with  different  activation 
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Fig.  4  Extraction  energies  of  the  arsenic  common  anion  alloys  as  a  function  of  neighbonng  configurations  for  a  concentration 

•r  =  0.5, 


energies.  This  will  occur  if  .Vj  exceeds  the  percola¬ 
tion  threshold  [17],  Clearly  these  extraction  energy 
curves  also  suggest  that  nonisoelectronic  impurity 
substitution  energies  will  vary  for  the  different 
clusters.  Moreover,  ion  milling  and  reactive  ion 
etching  may  be  quite  different.  For  example,  it  has 
been  observed  [18]  that  in  the  presence  of  a  gal¬ 
lium  ion  beam,  InAs  etches  an  order  of  magnitude 
more  rapidly  in  a  chlorine  gas  environment.  This 
might  be  anticipated  from  fig.  3.  since  the  indium 
extraction  energy  from  a  Gain,,  cluster  is  smaller 
than  that  from  an  In,;  cluster.  It  remains  to  be 
demonstrated  that  this  effect,  rather  than  some 
damage  mechanism,  is  responsible  for  the  in¬ 
creased  etch  rate.  Many  other  effects  are  implied 
by  figs.  4  and  5  as  well  as  similar  results  for  all  the 
other  III-V  and  II-VI  alloys.  These  kinds  of 


considerations  obviously  will  play  a  role  in  any 
device  processing  design  rules  applied  to  alloys. 

5.  Surface  segregation 

It  is  well  established  in  metal  alloys  that  dif¬ 
ferences  between  the  vacancy  formation  energies 
of  the  constituents  drive  concentration  variations 
near  the  surface  [19],  but  this  effect  has  not  been 
treated  previously  in  semiconductor  alloys.  We 
have  begun  this  theory  by  treating  the  solid  as  a 
succession  of  layers  parallel  to  the  surface  [19]. 
The  interaction  energies  of  atoms  within  a  layer 
and  between  layers  are  all  included  in  the  average 
enthalpy.  The  configuration  entropy  of  each  layer 
is  calculated  from  the  appropriate  version  of  eq. 
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Fig.  5  Exiraction  energjes  of  the  gallium  common  cation  alloys  as  a  function  of  neighboring  configurations  for  a  concentration 

X  =  0.5. 


(10)  for  pair  interactions  and  for  each  layer  i,  x  is 
replaced  by  the  layer  concentration  x,.  The  con¬ 
centration  of  each  layer  is  then  set  by  the  condi¬ 
tion  that  the  chemical  potential  be  constant. 

We  find  several  startling  results.  If  the  solid  has 
equilibrated  at  a  temperature  T>  T^.  the  critical 
temperature  for  spinodal  decomposition,  then  the 
surface  concentration  returns  to  the  bulk  value  in 
a  few  atomic  layers  However,  if  the  equilibration 
temperature  is  below  7^.,  then  the  range  of  the 
segregation  profile  is  much  longer  and  the  con¬ 
centration  will  oscillate  with  a  period  and  ampli¬ 
tude  that  depend  on  the  equilibration  temperature 
and  the  average  composition.  The  oscillations  can 
have  a  large  modulation  index  and  periods  of  4 
and  20  lattice  spacings  have  been  found.  This  is  a 
superlattice  phase  that  is  stabilized  by  surface 
segregation  and  an  order-disorder  transition.  We 
are  currently  examining  the  influence  of  a 


boundary  condition  corresponding  to  an  epitaxial 
layer  on  a  substrate. 

It  is  too  early  in  our  calculations  to  determine 
whether  surface  segregation  is  responsible  for  any 
of  the  long-range  order  observed  in  several  alloys. 
However,  the  effect  is  strong,  being  driven  by  the 
same  mechanisms  responsible  for  high-tempera- 
ture  order-disorder  transitions,  and  requires  rela¬ 
tively  little  diffusion  to  establish  the  relevant  con¬ 
centration  patterns,  so  we  anticipate  that  it  will 
result  in  observable  phenomena. 

6.  Concluding  remarks 

There  are  two  separate  aspects  to  the  correla¬ 
tion  state  of  an  alloy.  The  first  is  specified  by  the 
population  distribution  of  the  different  types  of 
clusters,  taking  the  alloy  to  be  spatially  homoge- 
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neous.  The  second  is  ihe  way  the  alloy  separates 
below  the  critical  order-disorder  transition  tem¬ 
perature  into  a  spatially  inhomogeneous  arrange¬ 
ment.  The  cluster  population  distributions  are 
driven  only  by  small  subtle  aspects  of  the  lattice- 
mismatch-induced  strain  and  chemical  terms,  the 
reduced  excess  energies, while  the  mixing  enthalpy 
parameter  that  determines  the  order-disorder 
transition  temperature  depends  on  the  total  strain 
and  chemical  contributions  to  the  cluster  excess 
energie.s.  Because  only  the  small  reduced  excess 
energies  {d  )  enter  into  the  populations  (.x^ },  the 
theory  may  have  to  take  account  of  small  effects 
not  previously  thought  to  be  significant  if  accurate 
predictions  are  to  be  made. 

Vacancy  formation  energies  are  sensitive  to  the 
loccl  surroundings  from  which  an  atom  is 
extracted,  as  well  as  the  final  state  of  the  atom, 
e  g.,  the  particular  surface  orientation  and  its  local 
arrangement.  From  the  systematic  variation  of  the 
extraction  energies  alone,  it  is  evident  that  multi¬ 
ple  activation  energy,  and  parallel  path  diffusion 
should  be  expected  in  alloys.  Moreover,  noniso- 
electronic  impurities  are  likely  to  have  preferred 
cluster  sites  as  well,  although  we  have  done  no 
explicit  calculations  of  dopant  site  preferences. 

The  vacancy  formation  energy  differences  be¬ 
tween  alloy-substituted  atoms  were  also  shown  to 
drive  surface  segregation.  For  examples  whose  ef¬ 
fective  equilibration  temperature  is  above  the  criti¬ 
cal  order-disorder  transition  temperature,  the 
surface  will  be  rich  in  whichever  component  pre¬ 
fers  to  go  to  the  surface  and  the  composition 
returns  to  the  bulk  value  in  a  few  lattice  spacings. 
However,  for  samples  equilibrated  below  the  criti¬ 
cal  temperature,  our  preliminary  results  indicate 
that  compositional  oscillations  may  occur,  produc¬ 
ing  a  superlattice.  Details  remain  to  be  established 
for  various  materials  systems.  This  arrangement 
corresponds  to  a  new  type  of  metastable  phase 
that  is  dnven  by  a  surface  boundary  condition. 
The  free  energy  of  this  phase  lies  between  that  of 
a  homogeneous  alloy  and  its  spinodal  decomposed 
arrangement.  Whether  or  not  this  new  phase  cor¬ 
responds  to  any  of  the  correlated  arrangements 
[20.21]  that  have  been  observed  in  alloys  grown  at 
low  temperature  remains  to  be  established. 
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A  Green's-function  method  has  been  used  to  study  the  surface  and  interface  electronic  structures 
of  the  II- VI  compounds  HgTe  and  CdTe.  Localized  surface  and  resonance  states  near  the  cation- 
terminated  I  lOOi  surface  of  CdTe  and  the  anion-terminated  surface  of  HgTe  have  been  found  for  the 
ideal  surfaces.  The  energies  and  strengths  of  these  surface  states  are  altered  by  surface  perturba¬ 
tions.  The  bulk  states  near  the  surface  are  drastically  modified  by  the  creation  of  the  surface,  but 
the  band  gaps  remain  unchanged.  Numerical  evaluation  of  the  local  densities  of  states  at  the  T  and 
J  points  shows  that,  at  the  llOOi  interface  of  HgTe/CdTe.  the  previously  observed  surface  states  are 
no  longer  present.  However,  in  the  interface  region,  bulk  states  of  one  material  penetrate  some  dis¬ 
tance  into  the  other  material. 


I.  INTRODUCTION 

One  of  the  earliest  formulations  used  in  the  investiga¬ 
tion  of  surfaces  and  interfaces  was  based  on  the  linear 
combination  of  atomic  orbitals  (LCAO)  method. 
Goodwin'  first  applied  the  LCAO  model  to  study  the  ex¬ 
istence  conditions  for  localized  Tamm’  states  in  a  crystal. 
He  found  that  these  states  occur  when  the  diagonal 
Coulomb  integrals  and  the  off-diagonal  resonance  in¬ 
tegrals  of  the  surface  are  allowed  to  be  different  from 
those  of  the  bulk,  for  systems  of  single-level  or  of  sp- 
hybridized  atoms.  At  the  same  time,  Shockley^  investi¬ 
gated  a  periodic  potential  that  is  terminated  at  its  max¬ 
imum  and  found  that,  under  the  condition  that  the  bulk 
bands  crossed,  surface  states  exist  in  the  middle  of  the 
band  gap.  Shockley  states,  as  they  have  come  to  be 
known,  come  into  being  when  the  perturbations  at  the 
surface  are  small  in  comparison  to  the  widths  of  the  al¬ 
lowed  energy  bands.  Koutecky'*  and  others’  generalized 
Goodwin's  model  to  study  the  energy  and  existence  con¬ 
ditions  of  surface  states.  Electronic  interface  states  were 
studied  within  the  LC.AO  method  by  Davison  and 
Cheng."  An  exact  tight-binding  solution  for  the  surface 
and  interface  problems  of  a  one-dimensional  'eiiii.'-onduc- 
tor  was  obtained  by  Dy  and  co-workers.’  * 

Since  these  early  investigations,  many  other  papers 
have  appeared  in  the  literature  for  the  study  of  surface 
and  interface  properties  of  solids.  Kalkstein  and  Soven 
<KS)  (Ref.  9)  introduced  a  Green’s-function  (GF)  formal¬ 
ism  to  siudy  the  surface  electronic  properties  of  semi¬ 
infinite  crystals.  This  is  a  relatively  simple  but  powerful 
method  by  which  both  the  bulk  and  surface  properties  of 
a  semi-infinite  crystal  can  be  studied.  The  method  can  be 
generalized  in  a  straightforward  manner  to  study  the 
electronic  properties  of  an  interface  formed  by  joining 
two  semi-infinite  crystals.  Because  of  its  simplicity  and 
power  the  method  of  KS  was  widely  applied  in  many  cal¬ 
culations  during  the  decade  following  its  develop¬ 


ment.'®'  '■*  For  the  same  reasons  we  apply  this  method  in 
this  work  to  investigate  surface  and  interface  electronic 
structures  of  the  pure  ll-Vl  compounds  CdTe  and 
HgTe.”'" 

Despite  the  intense  interest  m  Hg,_,Cd,Te  over  the 
past  three  decades  due  to  its  application  in  the  prepara¬ 
tion  of  infrared  detectors,  there  have  been  relatively  few 
experimental  and  theoretical  investigations  of  the  surface 
and  interface  properties  of  this  system.'’' Since  the 
KS  method  is  well  suited  for  the  study  of  the  surface 
properties  of  a  system  described  by  a  tight-binding  Ham¬ 
iltonian,  we  use  here  a  Hamiltonian  closely  related  to  that 
obtained  by  Hass,  Ehrenreich,  and  Velicky  (HEV)  (Ref 
20)  for  HgTe  and  CdTe  in  the  empirical  tight-binding  ap¬ 
proximation  (ETBA)  based  on  the  LCAO  interpolation 
scheme  of  Slater  and  Koster.’’  In  the  ETBA  the  predict¬ 
ed  band  structures  of  the  pure  compounds  HgTe  and 
CdTe  are  matched  to  experimentally  determined  band  en¬ 
ergies." 

In  Sec.  II,  we  introduce  the  formalism  of  KS  for  the 
description  of  the  surface  properties  of  Il-VI  materials 
along  with  the  extension  of  the  technique  to  the  problem 
of  interfaces  of  these  materials.  Section  III  contains  the 
results  of  our  calculation  and  a  discussion. 

II.  FOR.MALISM 

In  the  formalism  of  Kalkstein  and  Soven,"  a  pair  of 
semi-infinite  crystals  is  formed  by  introducing  a  cleavage 
plane  into  an  infinite  crystal  in  one  crystallographic 
direction.  A  GF  describing  the  electronic  properties  of 
the  semi-infinite  systems  is  derived  from  the  GF  of  the 
infinite  crystal  and  a  scattering  potential  representing  the 
cleavage.  When  combined  with  a  tight-binding  formal¬ 
ism  in  which  nearest-  and  next-nearest-ncighbor  interac¬ 
tions  are  included,  the  scattering  potential  is  relatively 
easy  to  calculate  making  application  to  realistic  systems 
simple.  The  Hamiltonian  is  constructed,  following  HEV, 
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for  HgTc  and  CdTe  using  jp  '  basis  states  and  including 
spin  "'  ■'  The  parameters  used  are  thosr  of  Slater  and 
Koster*'  as  determined  b>  HE\'  (Ref.  20  e.xcept  that,  m 
our  case,  the  values  of  Esx  '  1 10  •  and  Esx  lOl  1  >  are  inter¬ 
changed.  This  produces  band  structures'*  '®  that  are 
qualitati\el>  similar  to  those  of  Bryant  which  offer  im¬ 
proved  conduction-band  mass.'*  We  calculate  the  GF,  G 
associated  with  this  Hamiltonian  by  using  the  defining 
equation 

G  =^E  '  .  ll) 

where  H  is  the  Hamiltonian  of  the  infinite  unperturbed 
system.  It  is  understood  that  G  is  to  be  calculated  in  the 
limit  as  the  positive  imaginary  part  k  approaches  zero. 
These  calculations  are  performed  in  k  space,  utilizing  ful¬ 
ly  the  periodicity  of  the  crystal. 

For  the  surface  iinterfacei  calculations,  since  we  no 
longer  have  translational  periodicity  along  the  direction 
perpendicular  to  the  surface  (interface),  we  cannot  use  an 
ordinary  A:-space  representation.  Kalkstein  and  Soven'* 
assume  periodic  structure  parallel  to  the  surface  and  use 
a  representation  consisting  of  states  which  are  localized 
on  planes  of  atomic  sites  parallel  to  the  surface  and 
represented  by  the  inde.x  n,  and  of  Bloch  states  reflecting 
the  periodic  symmetry  within  the  planes  which  are 
represented  by  the  index  k  .  This  s  the  mixed  or  Bloch- 
Wannier  representation  We  assume  the  same  type  of 
symmetry  in  the  interface  system.  The  Hamiltonian  and 
the  GF  of  the  bulk  crystal  as  well  as  the  Hamiltonians  of 
the  semi-infinite  and  interface  systems  must  all  be  ex¬ 
pressed  in  this  mixed  representation.  The  formulas  for 
the  semi-infinite  system  were  derived  by  KS  and  are  the 
same  for  us  if  we  reinterpret  the  algebraic  expressions  as 
matrix  equations  for  the  sp  '-spin  basis  states.  Note  that 
in  this  paper  we  examine  the  surfaces  and  interfaces  per¬ 
pendicular  to  the  [100]  direction  for  pure  CdTe  and 
HgTe  samples.  For  these  compounds,  this  structure  cor¬ 
responds  to  an  arrangement  in  which  the  anions  and  cat¬ 
ions  are  placed  in  alternating  planes  parallel  to  the  sur¬ 
face  or  interface.  For  notational  simplicity,  in  the  follow¬ 
ing,  a  cation-anion  pair  of  planes  is  given  a  single  layer 
index  n.  with  the  species  index  v  left  to  distinguish  be¬ 
tween  the  two  species  (layers).'*  '® 

For  basis  states  of  the  infinite  system,  KS  used  states 
analogous  to  aavk),  where  a=s,x,y,z  indicates  the 
atomic  orbital,  the  spin  ( T,  1'  is  represented  by  a .  v  =  a  or 
c  stands  for  anions  or  cations,  respectively,  and  k  is  the 
wave  vector.  Following  KS,  we  write  the  Bloch-Wannier 
states  for  our  system  as 


where  R,,  is  the  position  of  the  ion  sublattice  represented 
by  V  on  the  plane  labeled  by  n,  and  k  and  k  are  the  com¬ 
ponents  of  k.  parallel  and  perpendicular  to  the  surface 
(interface)  plane,  respectively.  This  basis  set  reflects  the 
symmetry  of  the  semi-mfinite  system  and  is  therefore  well 
suited  for  our  purpose.  In  the  following  we  suppress  the 
k  index  for  compactness  of  notation,  as  in 


Gin.n'}=  2  2  <<a<zv7ik  G  a'a'y'n'k  ) 

ao  >  u  i7  V 

X  aavnk  )  ( a'a'v'n  'k  ,  (3; 

which  is  the  GF  submatnx  between  layer  n  and  layer  n'. 

From  the  site-diagonal  GF,  Gin,ni,  we  may  calculate 
the  local  density  of  states 

p,d  F '  =  —  (  1 /rr  I  Im  Tr,G  i  n.n  '  ,  (4 

where  Tr,  indicates  that  the  trace  is  to  be  earned  out 
only  for  the  species  'layeri  given  by  v. 

The  GF  of  the  semi-infinite  crystal,  G  is  found 
through  the  application  of  the  Dyson  equation. 

G'  =  G-)-GFG'.  (5) 

Besides  calculating  the  matrix  elements  of  the  infini: 
crystal  GF,  G.  we  must  also  find  the  matrix  elements  of 
the  scattering  potential,  F,  introduced  by  the  creation  of 
the  surface  We  label  the  double  layers  m  the  crystal 
with  integers  such  that  the  cleavage  plane  passes  between 
the  double  layers  —  1  and  0.  and  the  semi-infinite  crystal 
of  interest  starts  at  the  zeroth  layer  and  has  layer  indices 
n  ^0.  Because  G'ln.n')  is  zero  for  all  n  and  n'  less  than 
zero,  the  only  matrix  elements  of  the  scattering  potential 
that  enter  into  Eq.  (5)  in  the  present  model  are  I'l  —  l.Oi 
and  F(0,  -  I  )  which  express  the  severing  of  the  interac¬ 
tions  across  the  cleavage  plane.  We  may  also  include  a 
diagonal  term  I'lO.Oi,  if  we  wish,  to  allow  for  surface 
perturbations  such  as  relaxation  and  environmental 
shifts,  into  our  calculation  These  scattering  potentials 
have  explicit  dependence  on  k  and  the  pair  of  indices  as¬ 
sociated  with  F  refer  to  the  double  layers  involved  m  the 
interaction.  In  terms  of  the  Hamiltonian  matrices,  the 
scattering  potential  matrix  is  given  by 

F  =  //-W,  (61 

where  N'  is  the  Hamiltonian  of  the  semi-infinite  crystal. 
The  scattering  potential  matrix  elements  describing  the 
breaking  of  interactions  across  the  cleavage  plane  are'^ 

F(0.-1)=  -//(O.-n  , 

("i 

Ft  -  1,0)=  -//(  -  1,0)  , 

where,  as  before,  the  k  index  has  been  suppressed  for 
compactness  of  notation.  The  matrix  element  FiO.O'  is 
introduced  parametrically  to  account  for  the  shifts  in  the 
atomic  levels  and  hopping  interaction  at  the  surface  lay¬ 
er. 

Explicitly,  a  general  matrix  of  G'  is  found  from  Eq.  (5) 
as 

G'tn,n')  =  Gin.n’) 

-[Cln.O)F(O.O) 

-t-Gm.  -  liFi  -  l.OijG'iO.n')  .  (8) 

with  n  and  n'tO  To  solve  this  equation,  it  is  first  neces¬ 
sary  to  solve  for  G'iO.n'i  which  is  done  by  setting  n  equal 
to  zero  in  Eq.  (8)  and  solving  for  the  matrix  element 
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C'iO,n'i  =  [l  -Ci0.0iU,0.0' 

-G'O,  -  1  T’i  -  l,0i]  '  'GiO.n'i  .  (9) 

To  find  the  genera!  matrix  element  of  G'  we  need  only  to 
substitute  Eq.  i9i  into  Eq.  (8).  To  include  environmental 
effects  on  the  surface  atoms,  we  introduce  a  parameter  5 
which  measures  a  shift  of  the  surface  layer  on-site  ener¬ 
gies.  Also,  we  introduce  a  parameter  d  which  represents 
the  strength  of  the  interactions  between  the  first  two 
planes  of  atoms  m  the  semi-infinite  crystal  relative  to  the 
same  interactions  in  the  bulk.  The  parameters  6  and  d 
are  included  in  the  matrices  F(0,0l  and  F(  —  1.  —  1 ). 

This  mode!  was  first  extended  to  the  interface  between 
two  mode!  semi-infinite  one-band  crystals  by  Yaniv.'^ 
We  further  extend  the  technique  to  real  crystals  using  the 
sp^  with  spin  basis  states.  The  interface  is  formed  be¬ 
tween  crystal  ,4  (n  E  —  1 )  and  crystal  B  (n  ^  0).  In  joining 
the  crystals,  the  interactions  between  the  Te  and  Cd 
planes,  as  well  as  those  between  the  Te  planes,  across  the 
interface,  are  taken  to  be  the  same  as  in  the  bulk.  Since 
there  are  no  data  available  for  the  hopping  integrals  be¬ 
tween  Hg  and  Cd  planes  we  assume  them  to  be  equal  to 
the  average  of  the  interactions  between  bulk-crystal  (100) 
Cd  planes  in  CdTe  and  interactions  between  bulk-crystal 
(ICX))  Hg  planes  in  HgTe.  The  interactions  between  the 
A  and  B  sides  of  the  interface  are  included  in  the  scatter¬ 
ing  potentials  U'lO,  —  1 1  and  F'(  —  1,0).  To  our  Green’s 
function  we  add  subscripts  a  and  0  which  take  on  the 
values  A  and  B  to  indicate  the  side  of  the  interface  in¬ 
volved  in  the  calculations.  Once  again  solving  the  Dyson 


equation  for  the  interface  GF,  g  iRef.  16), 
g  =C'-^G'Ug 

we  find  the  interface  GF  matrix  elements  as 

g^g(m,n)  =  G'^im,n)6^lj 

'G^lrn.OmO.  -  1  )g  ^g(  -  l.n}d^ 

G\im,  —  1  )y'i  —  l,0)ggg(0,n)d^^  ,  (10) 

where 

g„  ,(  -l.n)  =  [l-G.V-l.-l)F(-l,0)Ga(0.0) 

XK(0.-l)]“'c;<(-l,n)  ,  (11a) 

gaa(0.n)  =  [l-Ga(0.0)F(0.-l)G;,(  -1,-1) 

X  F( -l,0)]''Gfl(0.n)  ,  (11b) 

ga,(0,n)  =  Ga(0.0)K(0,-l)g„.,(-l,rt)  ,  (11c) 

g^a(-l.n)  =  GV-l.-I)^'(-1.0)gaB(0.n)  ,  (lid) 


and  5^g  is  the  Kronecker  delta.  We  have  assumed  a  per¬ 
fect  match  in  the  lattice  spacing  across  the  interface 
which  is  nearly  true  for  HgTe  and  CdTe  with  a  difference 
in  spacing  of  only  0.3%. 

In  the  evaluation  of  the  GF  of  the  bulk  system  a  great 
deal  of  numerical  integration  is  required.  The  integrals 
are  evaluated  through  the  use  of  a  Lagrange  interpolation 
scheme  in  which  the  time  to  calculate  the  band  structures 
is  reduced  by  evaluating  them  exactly  at  only  a  few  points 


in  k  space  and  interpolating  for  the  intermediate  values 
Accuracy  is  ensured  through  sampling  the  functions  at  a 
sufficient  number  of  points.  All  calculations  are  per¬ 
formed  with  a  small  positive  imaginary  component  in  the 
energy  and  the  final  results  at  the  real  energy  axis  are 
evaluated  through  the  use  of  the  analytic  continuation 
procedure  of  Hass,  Velicky,  and  Ehrenreich."' 

One  quite  useful  feature  of  our  calculation  is  that  the 
evaluation  of  the  GF's  of  the  semi-infinite  crystal  and  the 
interface  requires  much  less  computational  effort  than  the 
evaluation  of  the  infinite  crystal  GF.'"  As  a  result  of  this 
speed,  we  are  able  to  consider  several  ^.Jes  of  the  envi¬ 
ronmental  shift  and  surface- nearest-neighbor  interac¬ 
tion.  In  order  to  observe  the  effects  of  the  variation  of 
these  parameters  on  the  localized  states  we  examine  the 
local  densities  of  states  (LDOS)  at  various  values  of  the 
parameters  for  a  few  CdTe  and  HgTe  surfaces  and  inter¬ 
faces.  These  LDOS  are  evaluated  at  fixed  values  of  k  so 
that  we  may  find  the  positions  of  the  localized  state  bands 
in  the  surface  Brillouin  zone.  For  details  of  the  evalua¬ 
tion  of  the  matrix  elements  of  the  infinite  crystal  GF  and 
those  of  the  scattering  potentials  required  for  our  calcula¬ 
tion.  one  IS  referred  to  Refs.  15  and  16. 


En«rgy(#S0 

FIG.  1.  Dependence  of  the  LDOS  on  the  parameters  6  and  d 
as  compared  to  the  infinite  crystal  for  (a)  the  cation  surface  lav- 
er  and  (b)  the  first  interior  anion  layer  at  F  (k  =0)  for  CdTe 
cation-terminated  (100)  surface. 
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III.  RESULTS  AND  DISCUSSION 

Once  the  matrix  elements  of  the  infinite  crystal  GF  and 
the  relevant  scattering  potentials  are  specified,  one  can 
calculate  the  matrix  elements  of  the  GF  of  the  semi¬ 
infinite  sample  and  that  of  the  sample  with  the  interface 
using  Eqs.  (8)  and  (10),  respectively.  One  can  further  cal¬ 
culate  the  LDOS  at  various  layers  using  an  equation  of 
the  type  of  Eq.  (4).  In  this  paper  we  present  the  LDOS 
calculated  at  the  symmetry  points  F  and  J  and  at  ener¬ 
gies  near  the  valence-  and  the  conduction-band  edges  of 
the  sample  where  the  principal  band  gaps  occur. 

In  Figs,  1  and  2  we  plot  the  surface  LDOS  at  the  F 
point  for  various  combinations  of  the  environmental  shift 
parameter  6  and  the  geometric  shift  parameter  (relaxa¬ 
tion  parameter)  d  for  CdTe  cation-terminated  and  HgTe 
anion-terminated  crystals,  respectively.  We  begin  by  not¬ 
ing,  in  Fig.  1,  the  existence  of  a  localized  surface  state 
(bound  state)  which  is  the  sharp  structure  that  appears  in 
the  LDOS  within  the  band  gap  of  the  CdTe.  whether  or 


not  there  is  a  surface  perturbation.  The  ideal  cation- 
truncated  (100)  surface  of  CdTe  has  a  surface  state  at  F 
with  energy  lying  at  £=0.6  eV  above  the  top  of  the 
valence  band.  We  also  see  that  the  bulk  densities  of 
states  are  modified  as  a  result  of  the  introduction  of  a  sur¬ 
face  but  the  band  gap  of  CdTe  remains  unchanged.  We 
note  that  the  position  of  the  bound  state  is  sensitive  to 


FIG.  2.  Dependence  of  the  LDOS  on  the  parameters  6  and  d 
as  compared  to  the  infinite  cr-  stal  for  lal  the  anion  surface  layer 
and  ibl  the  first  interior  cat. on  layer  at  F  Ik  =0i  for  HgTe 
cation-terminated  t  lOOi  surface. 


FIG  3.  Dependence  of  the  LDOS  on  the  parameters  8  and  d 
as  compared  to  the  infinite  crystal  for  la)  the  anion  surface  layer 
and  <b)  the  first  interior  cation  layer  at  y  [k  =  rr/ol  v  2.  v  2 1]  for 
HgTe  anion-terminated  (100)  surface. 
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ELECTRONIC  STRL'CTCRES  OF  HgTe  AND  CdTe  SURFACES 


vanaiions  in  both  d  and  6  In  Fig  2.  we  notice  that  the 
surface  has  again  introduced  a  marked  change  in  the 
LDOS  of  HgTe.  A  resonance  state  appears  in  the  form  of 
a  spike  inside  the  conduction  band  of  semimeial  HgTe. 
The  contribution  of  this  resonance  to  the  LDOS  is 
strongest  for  anion  layers  indicating  that  anion  slates  are 
the  most  likely  constituents  of  this  state.  Similar  surface 
slates  have  been  calculated  by  Bryant'*  for  the  special 
case  w  here  no  surface  perturbations  exist. 

The  surface  states  calculated  for  the  HgTe  at  the  J 
point  where  there  exists  a  gap  are  shown  in  Fig.  3.  These 
states  exhibit  the  same  dependence  on  the  interactions 
that  was  seen  at  the  T  point.  Here  instead  of  a  single 
peak  we  see  a  pair  of  bound  states  just  above  the  valence 
band  within  the  band  gap.  Also  above  the  conduction 
band  there  is  a  bound  state  that  was  not  observed  for  the 
r  point.  Anions  contribute  more  strongly  to  the  bound 
states  inside  the  band  gap  while  the  cation  contribution  is 
stronger  to  the  bound  state  above  the  conduction  band. 
Also  notice  that  the  bulk  states  are  strongly  modified  by 


FIG.  4.  Comparison  of  the  surface  LDOS  of  the  cation- 
terminated  (100)  surface  of  CdTe  with  the  LDOS  plotted  (a)  at 
successive  cation  layers  and  (b)  anion  layers  for  the  CdTe  side  of 
the  (lOOl  interface  CdTe/HgTe  at  F. 


the  introduction  of  the  surface  and  that  the  band  gaps  are 
not  altered  by  the  surface. 

Our  calculation  of  the  layer  dependence  of  the  surface 
Slates  shows  that  these  states  become  progressively  less 
significant  as  we  examine  deeper  inside  the  crystal  indi¬ 
cating  these  states  are  indeea  localized  bound  states.  The 
bulk  LDOS  in  the  deeper  layers,  on  the  other  hand,  start 
resembling  the  infinite  crystal  LDOS. 

As  mentioned  before,  the  interface  calculations  have 
been  performed  assuming  no  lattice  mismatch  in  the  crys¬ 
tal  structures  of  the  HgTe  and  CdTe  compounds,  thus 
neglecting  all  strains  that  may  be  present  at  the  interface 
The  parameters  chosen  for  our  calculation  also  do  not  al¬ 
low  for  valence-band  oflTset.  With  these  assumptions, 
dramatic  effects  are  seen  at  the  interface  of  HgTe  and 
CdTe  as  shown  in  Figs.  4  and  5,  where  the  LDOS  in  the 
CdTe  side  of  the  interface  are  presented  at  F  and  J 
points,  respectively.  The  most  obvious  effect  is  the  lack 
of  localized  states  that  were  previously  seen  at  the  free 
surfaces  of  these  materials.  Instead,  we  find  that,  close  to 
the  interface,  the  bulk  states  of  HgTe  that  lie  in  the  re¬ 
gion  between  0  and  1.6  eV  appear  in  the  band  gap  of 


FIG.  5.  Comparison  of  the  surface  LDOS  of  the  cation- 
terminated  (1(X))  surface  of  CdTe  with  the  LDOS  plotted  lai  ai 
first  cation  layer  at  the  interface  and  (b)  the  first  interior  anion 
layer  for  the  CdTe  side  of  the  ( 100)  interface  CdTe/HgTe  at  J 
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CdTe  at  the  f  point.  These  states  in  the  CdTe  gap  be¬ 
come  less  significant  at  layers  further  from  the  interface 
while  at  the  same  time  the  LDOS  in  the  energy  ranges  of 
the  conduction  and  valence  bands  of  CdTe  increase  in 
strength.  Thus  we  have  a  narrowing  of  the  gap  of  CdTe 
in  a  limited  region  near  the  interface  which  allows  con¬ 
duction  electrons  to  penetrate  into  the  CdTe  from  the 
HgTe  over  a  short  distance.  In  Fig.  5,  we  explicitly  see 
how  the  gap  at  the  CdTe  layers  is  reduced  with  respect  to 
pure  CdTe  at  J.  Calculation  of  the  LDOS  in  the  HgTe 
side  of  the  interface  shows  similar  effects. 

The  above  results  describe  how  the  LDOS  change  from 
the  bulk  to  the  surface  and  then  from  the  surface  to  the 
interface.  While  the  bulk  and  the  interface  results  can  be 
considered  realistic,  the  surface  results  may  not  be,  since 
the  surface  reconstruction  has  not  been  included  in  the 
calculation.  Recent  experiments'  and  a  structural 

theory’'^  indicate  that  the  surfaces  of  CdTe  and  some  oth¬ 
er  II-VI  compounds  undergo  reconstructions  similar  to 
those  on  the  surfaces  of  III-V  compounds.  However, 
similar  measurements  are  yet  to  be  made  on  HgTe  sur¬ 
faces  and  CdTe/HgTe  interfaces.  Our  calculations  indi¬ 
cate  that  changes  in  the  electronic  structure  in  the  form 
of  environmental  shifts  at  the  surface  lead  to  only  minor 
changes  in  the  localized  surface  and  the  bulk  LDOS. 
Whether  a  surface  reconstruction  will  result  in  substan¬ 
tial  modification  of  these  states  is  yet  to  be  determined. 

In  conclusion,  we  have  described  in  this  paper  how  one 
can  incorporate  the  GF  method  of  KS  to  calculate  the 
surface  and  interface  structures  of  II-VI  compounds 


without  having  to  deal  with  the  artificial  super-cell  ap¬ 
proximation.  Our  calculation  shows  that  drastic  changes 
in  the  LDOS  can  occur  because  of  creation  of  surfaces 
and  interfaces.  Localized  surface  or  resonance  states  ap¬ 
pear  above  the  top  of  the  valence  band,  and  the  effects  of 
the  surface  on  the  LDOS  persist  at  several  layers  inside 
the  bulk.  At  the  interface,  there  are  no  localized  states 
but  the  bulklike  states  related  to  one  material  penetrate 
into  several  layers  inside  the  other  material  resulting  m 
narrowing  of  band  gaps  close  to  the  interface  for  the  wide 
band-gap  material.  At  present  there  are  no  systematic 
experimental  data  available  for  comparison  with  our  con¬ 
clusions. 

Finally,  even  though  we  have  not  included  all  aspects 
necessary  for  a  complete  description  of  the  surfaces  and 
interfaces,  our  work  can  be  considered  to  be  the  first  step 
toward  the  understanding  of  the  surface  and  interface 
electronic  structures  of  the  II-Vi  impounds  To  our 
knowledge,  the  results  preseniec:  '  e  are  new  for  the  in¬ 
terface  and  more  general  than  any  previous  surface  calcu¬ 
lations  on  the  11-VI  compounds.  Since  the  method  is 
flexible.  It  will  be  possible  to  incorporate  realistic 
structural  models  dealing  with  the  surface  reconstruc¬ 
tions,  when  they  become  available,  to  calculate  more  real¬ 
istic  electronic  properties  of  such  surfaces  and  interfaces. 
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Alloys  of  the  form  A,B^.  ,C  almost  always  have  a  different  surface  concentration  from  the  bulk 
in  order  to  maintain  a  constant  chemical  potential  for  each  layer  in  the  alloy.  We  have  calculated 
the  degree  of  surface  segregation  for  the  pseudobinary  alloys  Hg,  .  ,Cd,Te  and  Hg,  -,Zn^Te.  The 
enthalpy  respionsible  for  segregation  is  the  difference  in  the  energies  for  moving  an  .4  or  B  atom 
from  the  bulk  alloy  to  the  surface.  There  are  two  major  contributions  to  this  energy  process:  (Da 
bond-breaking  mechanism,  whereby  the  element  with  the  lowest  surface  energy  segregates  to  the 
top,  and  (2)  strain  release,  where  the  dilute  element  in  the  compound  segregates  to  the  surface  to  al¬ 
leviate  the  strain  energy  due  to  mismatch  of  the  AC  and  BC  bond  lengths.  In  our  segregation  mod¬ 
el,  the  free  energy  of  each  layer  is  calculated  in  the  regular  and  quasichemical  approximations.  By 
equating  the  chemical  potentials  of  each  successive  layer  to  the  bulk,  the  composition  of  each  layer 
IS  obtained.  Our  results  indicate  that  there  is  strong  surface  enrichment  of  Hg  in  Hg|-^Cd,Te 
while  there  is  less  surface  segregation  of  Hg  in  Hgi  -  ,Zn,Te  at  the  low  x  values  appropriate  for  in¬ 
frared  application.  Mercury  segregation  to  the  surface  will  lower  the  band  gap  or  may  turn  the  sur¬ 
face  into  a  semimetal,  thereby  affecting  the  passivation  of  the  surface. 


1.  INTRODUCTION 

Applying  such  probe  techniques  as  Auger  electron 
spectroscopy  (AES),  low-energy  electron  difraction 
(LEED),  and  ion-scattering  spectroscopy  (ISS)  allows  one 
to  determine  the  chemical  compositions  of  surfaces  with 
great  accuracy.  These  measurements'  have  shown  that 
the  chemical  composition  of  the  surface  is  different  from 
the  bulk  composition  tn  most  metal  alloys.  This 
phenomenon  of  surface  segregation  has  also  been  studied 
theoretically  in  great  detail  starting  with  the  work  of 
Gibbs,*  who  noted  that  the  surface  composition  will  be 
enriched  with  the  alloy  component  that  minimizes  the 
free  energy  associated  with  the  creation  of  the  surface 
(i.e.,  bond  breaking  at  the  surface).  From  Gibbs’s  adsorp¬ 
tion  equation  for  a  two  component  system  ( )  it 
has  been  shown’  that  the  surface  concentration  of  A 
atoms  (xj)  differs  from  that  of  the  bulk  (x^)  by  the  rela¬ 
tion 

-r^— =  — ^exp[(r5-r^  )£7/A:r]  ,  (1) 

where  y  ^ .  g ,  is  the  surface  tension  of  pure  A(B)  and  a  is 
the  average  area  per  atom. 

Application  of  Eq.  (1)  to  real  systems  does  not  provide 
good  quantitative  results  since  it  does  not  consider  any 
deviations  from  the  bulk  concentration  for  layers  beneath 
the  surface.  Multilayer  models  which  include  layer  com¬ 
positions  under  the  surface  were  formulated  by  Williams 
and  Nason^  among  others’  by  employing  a  regular  solu¬ 
tion  method.  Later  work  by  Kumar  et  extended 
the  surface  segregation  problem  to  take  into  account  non¬ 
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random  distribution  of  atoms  in  the  alloy  using  the  quasi¬ 
chemical  approximation  (QCA).*  Moran-Lopez  and  Fal- 
icov’  have  also  looked  at  the  segregation  problem  along 
with  surface  order  and/or  disorder  transitions  using 
Kikuchi's  method. 

The  primary  driving  force  for  surface  segregation  for 
an  alloy  4,5, can  be  thought  of  as  the  difference  in 
energy  on  moving  an  4  or  B  atom  to  the  surface.  As  not¬ 
ed  above,  one  contribution  is  the  bond-breaking  mecha¬ 
nism,  whereby  the  element  with  the  lowest  surface  free 
energy  segregates  to  the  top.  Experiments'  have  shown 
that  this  is  not  the  only  mechanism  that  causes  one 
species  to  prefer  to  be  on  the  surface.  In  dilute  alloys 
where  there  is  a  size  difference  in  the  component  species, 
the  dilute  element  in  the  compound  will  segregate  to  the 
top  to  alleviate  the  strain  energy  due  to  lattice  mismatch. 
An  example  of  this  is  seen  in  CuAu,  where  in  the  dilute 
Au  alloy  case,  Au  is  predominantly  on  the  surface.  ‘ 
Looking  only  at  the  bond-breaking  mechanism,  however. 
Cu  is  predicted  to  be  the  segregating  element  since  it  has 
the  lower  surface  tension.  It  can  be  explained  by  the 
strain  release  argument  since  the  Au  atom  is  approxi¬ 
mately  15%  larger  than  the  Cu  atom.  The  connection 
between  strain  release  and  surface  segregation  was  first 
pointed  out  by  McLean"  in  his  study  on  grain  boun¬ 
daries.  Incorporation  of  both  strain  energy  and  bond 
breaking  in  surface  segregation  studies  has  been  done  by 
Wynblatt  and  Ku. 

In  the  theoretical  and  experimental  work  done  so  far, 
the  materials  studied  have  primarily  been  transition  and 
noble  metals.  The  effect  of  surface  segregation  on  metals 
has  important  implications  when  dealing  with  catalysis, 
corrosion,  and  grain  boundaries.  In  this  paper,  however. 

5980 


f  1989  The  American  Physical  Society 


3?  SURFACE  SEGREGATION  IN  PSEUDOBINARY  ALLOYS  5981 


we  will  look  at  the  effects  of  surface  segregation  on  semi¬ 
conductor  alloys.  In  particular,  we  are  interested  in  the 
Il-VI  systems  Hg,  ,Cd,Te  and  Hg,  ,2n,Te.  The 
suitability  of  these  materials  for  infrared  detectors  is 
determined  to  a  large  extent  by  the  control  of  the  surface 
.  imposition  and  the  capability  for  passivation.  Because 
of  the  structural  weakness  of  Hg,  ,Cd,Te,  it  has  been 
suggested  that  Hg,  ,Zn,Te  may  be  better  suited  for  in¬ 
frared  detectors.  We  are  interested  in  how  the  structural 
and  electronic  properties  are  modified  by  the  presence  of 
surfaces  and  interfaces.  In  regards  to  surface  segregation, 
we  are  interested  in  e.xamining  the  extent  of  Hg  migration 
to  the  crystal  surface. 

In  the  first  part  of  this  paper  we  will  discuss  models 
that  describe  the  thermodynamics  and  amount  of  surface 
segregation  that  occur  in  pseudobinary  binary  alloys  of 
the  form  These  models  are  modifications  of 

the  regular  solution  by  Williams  and  Nason and  the 
QCA  theory  by  Kumar*"  ^  to  include  first-  and  second- 
nearest  neighbors  in  zinc-blende  structures.  We  note  that 
the  shortest  range  of  interaction  that  can  effect  the  atom¬ 
ic  distribution  in  a  bulk  pseudobinary  semiconductor  al¬ 
loy  is  the  second-neighbor  interaction  (e.g.,  A -A,  A-B, 
and  B  -B),  because  this  is  the  closest  distance  between  the 
alloying  atoms,  Thus,  in  the  QCA,  the  second-neighbor 
model  for  a  pseudobinary  alloy  is  equivalent  to  a  first- 
neighbor  model  for  the  usual  metal  binary  alloys.  While 
the  first-neighbor  interactions  (  A-C  and  B-C)  in  a  pseu¬ 
dobinary  alloy  have  no  effect  on  the  bulk  statistics,  their 
contributions  are  required  for  the  study  of  surface  segre¬ 
gation.  These  first-neighbor  terms  will  become  part  of 
the  “bond-breaking”  energies  which  may  become  the 
dominant  factor  for  surface  segregation  of  an  alloy  com¬ 
ponent.  Our  formalism  will  also  incorporate  the  release 
of  the  strain  energy  when  an  atom  positions  itself  on  the 
surface.  Section  III  presents  numerical  results  for  the 
layer  compositions  of  the  II-VI  alloys  Hg|_,Zn,Te  and 
Hg,-  _,Cd^Te.  We  will  adopt  the  experimental  and 
theoretical  mixing  enthalpies  quoted  in  Ref.  14  for  the 
second-neighbor  energy  parameters.  The  energies  re¬ 
quired  to  move  a  cation  from  the  alloy  bulk  and  place  it 
on  the  surface,  calculated  for  these  alloys  in  Ref.  15,  will 
be  used  as  the  bond-breaking  energies.  All  temperatures 
considered  will  be  above  the  critical  temperature  for 
phase  separation.  In  addition,  we  will  only  consider  sub¬ 
stitutional  segregation.  In  Sec.  IV  we  will  discuss  the  re¬ 
sults  and  give  ideas  for  further  work  on  this  topic. 

II.  THEORY 

Consider  a  zinc-blende  alloy  of  the  type  A„B,.^C, 
where  the  A  and  B  atoms  occupy  one  fee  lattice  while  the 
C  atoms  occupy  the  other  fee  lattice.  In  this 
configuration,  an  A  (or  B)  atom  is  surrounded  by  four  C 
nearest-neighbor  atoms  and  a  total  of  12  second-nearest 
neighbor  A  and  B  atoms.  Assuming  the  alloy  is  semi- 
infinite,  divide  the  alloy  into  layers  parallel  to  the  surface, 
where  each  layer  i  has  a  fractional  concentration  of  A 
atoms,  Jc,  (i  =  1  being  the  surface  layer).  The  layers  con¬ 
sidered  here  refer  to  layers  which  contain  the  alloying 
atoms  (A  and  B).  Note  that  for  the  zinc-blende  struc¬ 


ture,  a  layer  of  C  atoms  is  always  between  adjacent  alloy 
atom  layers.  Neglect  any  gas  vapor  effects  by  assuming  a 
vacuum  interface  with  the  surface 

To  determine  the  composition  distribution  at  thermal 
equilibrium  the  total  free  energy  of  the  system  F  is  mini¬ 
mized  with  respect  to  the  layer  concentrations  j  .v  ,  with 
the  constraint  that  the  total  number  of  .4  atoms  is  a  con¬ 
stant: 

2  X,  =  const  .  (2 1 

For  layers  deep  inside  the  alloy,  any  effects  of  the  surface 
are  negligible  and  the  concentration  is  that  of  the  bulk. 
x^.  Through  the  use  of  Lagrange  multipliers,  the  condi¬ 
tions  stated  above  can  be  written  as 


for  all  X,.  This  is  equivalent  to  saying  that  the  chemical 
potentials  of  each  layer,  defined  as  =3/'/3x,,  must  be 
equal  to  each  other: 

fiix,  )=fi(Xf, )  .  (4) 

We  will  write  the  total  free  energy  per  number  of  atoms 
in  a  plane  for  the  layers  system  as  a  sum  of  layer  contri¬ 
butions. 

f  =  2 -  <5) 

I  I 

where  U,  is  the  configurational  enthalpy  and  S,  is  the 
configurational  entropy  of  the  ith  layer.  As  will  be  seen 
later,  U,  and  S,  actually  depend  on  contributions  from 
neighboring  layers. 

In  the  regular  or  QCA  treatments,  the  interactions  in 
the  enthalpy  term  arise  from  pair  interactions  only,  the 
interactions  in  our  case  being  .4-C  and  B-C  for  the 
nearest  neighbor  and  A- A,  A-B,  and  B-B  for  the  second 
nearest  neighbor.  These  bond  enthalpies  are  written  as 
z',!  and  z,j  for  the  i-j  pair  interactions  at  the  surface  and 
below  the  surface,  respectively.  As  we  shall  see  later,  the 
distinction  between  surface  and  nonsurface  bond  enthal¬ 
pies  is  done  in  order  to  consider  the  effects  of  strain 
release. 

In  a  given  layer,  an  /4  or  B  atom  is  surrounded  by 

z'"=z;"+zV'+z;" , 

16) 

Z'"  =  Z["  +  ZV’  (surface)  , 

nearest-neighbor  C  atoms.  In  (6),  Z|’'  is  the  number  of 
nearest  neighbors  on  the  same  layer  and  Zj  '  and  Zg  '  are 
the  number  of  nearest  neighbors  on  the  top  and  bottom 
layers,  respectively.  Similarly,  for  the  second  neighbor 
A- A,  A-B,  and  B-B  interactions,  the  number  of  second- 
nearest  neighbors  at  and  below  the  surface  will  be  ex¬ 
pressed  as 

Z  =Zl+Zg-(-  Zj  , 

(7) 

Z=Zi^-^Zg  (surface). 
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Regular  solution.  In  the  context  of  the  regular  solution,  the  distribution  of  atoms  in  a  given  layer  is  assumed  to  be 
random  and  the  entropy  term  per  layer  is  simply  the  form  for  random  distribution  of  atoms  on  a  given  layer 

S,  = -/c  [x,lax, +( 1  — X,  )ln(  1  — X, )]  .  (8» 

Tne  enthalpy  for  the  first  layer  is  given  by 


+  +  +  2x,(l-x,  )£‘^a  +  ll-x,  l-tia] 


while  for  all  other  layers  the  enthalpy  is 

U.=  Zl"+  “  ^  '  Ke,c  +  (l-^,)£Bc] 


2='*'  Z„ 


+  X.(l-Xj  )Z^g 


AB  T'Exx 


4-(l-x^)(l-x^)eja]  . 


where 


Zj,  i  —i  1 

^L’  J-‘  (H) 

Zg,  y=/  +  l 

and  the  ~  terms  are  there  to  prevent  double  counting. 
For  those  layers  in  the  bulk,  the  free  energy,  fj,  is  ob¬ 
tained  from  Eqs.  (10)  and  (11)  by  setting  x, ^.,=x, 

,,,  z;"+z|-" 

^b-  ^ + 

+  y  [xh X X  +  2xj,(  1  -Xj,  )e  -f- ( 1  -Xj  ihga  1 

+  /c^x^lnx^  -I-  ( 1  -Xj  )ln(  1  -Xj, )]  .  (12) 

Now  that  we  have  the  total  free  energy  of  our  semi¬ 
infinite  alloy  we  can  use  our  condition  for  thermal  equi¬ 
librium  [Eq.  (3)]  to  relate  the  layer  concentrations  to  the 
bulk  layers  as 

■*1  ■**  tSF'‘*/kT 

- e  '  ,  (13) 

1-X,  1-Xfc 

where  the  enthalpy  of  segregation  AF“*  is  given  by 
AFr»  =  A£,-A£fl-ZiE^(l-2x,) 


Z+Zi  Zj+Zg 


x,-ZxJ  ,  (Ua) 


A£*«-2£  Z^x,  -  -  —■  (x,*,-f-x,_,)-ZXfc 


(;^1),  (14b) 


In  Eq.  (14).  the  interaction  parameter  is  defined  as 


with  e’  defined  similarly.  The  first  term  in  Eq.  (14a)  is 
given  by 

,  Zr"  ,  Zl 

A£,=  Z['  +-^  £xc--Zt"£xc--y£xx 

£  X  X  Zg 

+  z,+— - -  (16) 

2  2 

with  a  similar  term  for  A£j,.  Note  that  the  first-neighbor 
interactions  only  enter  the  surface  term,  A£f*.  Also  note 
that  the  terms  |x,  |  before  Eq.  (13)  are  an  arbitrary  set  of 
layer  concentrations  while  in  Eqs.  (13)  and  (14)  and  here¬ 
after  |x,  j  are  to  be  defined  as  the  equilibnum  layer  con¬ 
centrations. 

In  the  ideal  solution  case  (i.e.,  £=0)  only  the  surface 
layer  is  different  from  the  bulk  and  the  degree  of  surface 
segregation  is  determined  entirely  b\  the  term 
A£^  —AEg.  Comparing  Eqs.  (1),  (13),  and  (14a)  we  see 
that  A£^  —  AEg  is  related  to  the  surface  tension. 

QCA  solution.  For  a  nonzero  interaction  parameter 
[Eq.  (IS)],  A- A  and  B~B  bonds  are  favored  when  £  is  posi¬ 
tive  while  A-B  bonds  are  favored  when  £  is  negative. 
This  contradicts  the  entropy  term  in  the  regular  solution 
(Eq.  (8)],  which  was  formulated  with  the  assumption  of  a 
random  distribution  of  the  atoms  in  the  alloy.  The  QCA 
formalism  takes  into  account  this  nonrandomness  via 
short-range  disorder  parameters.  In  the  standard  QCA 
for  bulk  alloys,  only  one  short-range  disorder  parameter 
is  required.  As  pointed  out  by  Kumar,  ^  however,  when 
one  is  considering  different  layer  concentrations  the  dis¬ 
tinction  must  be  made  between  intralayer  and  interlayer 
short-range  disorder. 

For  the  QCA,  the  segregation  equation  becomes 
■*1  ■*<.  -as**/* 


where  AS,***  is  the  entropy  of  segregation.  Our  QCA  re¬ 
sult  for  the  enthalpy  of  segregation  is  basically  the  same 
as  that  for  Kumar^  except  that  only  the  lateral  interac¬ 
tions  on  the  surface  contain  the  term 

A£f*  =  A£,,  -AEg-Z^i’i  1  —  2x,  )a, 

Zog  Zf  +  Zg 

+  2z  -^+  — - -  [l-2(l-x,)i3,] 

2  4 

-Zxt,a^  I  ,  (18a) 
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SFr*=2s 


Zi  a.  \  Z  f  Z g  'P 


Zr^Zg 


tJC_  ,  |/3, 


Zi_x,a,  -  ZxgOg  I  ,  il8b) 

where  a,  and  p  are  intralayer  and  interlayer  short-range 
disorder  parameters.  These  short-range  disorder  parame¬ 
ters  give  a  measure  of  the  degree  of  randomness  in  the  al¬ 
loy.  As  u,  —1  and  —  1.  the  alloy  becomes  a  random 
alloy  and  the  enthalpy  of  segregation  term  reverts  back  to 
Eq.  (14).  These  parameters  can  be  obtained  from  the 
mass-action  laws,” 

-U  -1/ 

£  PabPba 

— --t-^ln  -  =0,  (19a) 

'  PaaPbb 

e  ,,  PiB  Pba 

— --^ln  - ; - T  =0.  (  9b) 

Pa  a  Pbb 

where  p'^y  **  probability  that  an  X  atoms  is  on  the  ith 
layer  and  a  Y  atom  is  in  an  adjacent  position  on  the  jth 
layer.  For  the  surface  layer  (  /  =  1 )  e  is  replaced  by  e‘  in 
(19).  The  probabilities  are  summarized  below: 


P  ^  4  -  ■«.[  1  -  '  1  -  T  'a.  )  ■ 

P'ii  '  =-«  [1  1  -  T  - 1  1  . 

Pbb  = '  I  )(  1  ” -r,a  )  . 

Pbb'  ‘  >  1  -  x  h  1  -  .ic^ .  I  , 

■20' 

P'4B  '■-*,)  1  ><2,  . 

P'ib'  '  =•*,'  1  “  f  -  1  iP:  . 
pit  =  1  -J).  'Q.  . 

-l'  1  '•*;  >1^,  ■ 

Note  that  additional  interlayer  interaction  short-range 
parameter  Since  concentration  varies  with  layer,  two 
interlayer  short-range  disorder  parameters  are  required 
However,  the/J'  is  related  loP,  from  the  constraint 

=  ,  i:i' 

where  S'^y  is  the  number  of  X-Y  pairs  when  X  is  on  the 
Ith  layer  and  Y  is  on  the  yth  layer  and  .V'^  is  the  number 
of  A  atoms  on  the  ith  layer.  See  Refs.  6  and  7  for  com¬ 
plete  details. 

The  entropy  of  segregation  term  is  the  same  as  that  de¬ 
rived  in  Ref.  7  and  is  given  by 


AS^VA:  =  Z  In 


-Z'ln 


Zr  +  Zg 


,  Paa  ,  Pbb 
In  - r  —  In  - r- 

r,  '  '  *  '  p,  <  "  •  ‘ 

Pba  Pba 


■^P,{l-X,^^)g%-p,..^X,_^g‘/B' 


where  Z'  is  the  total  number  of  second-nearest  neighbors 
at  layer  i  and  h \g  and  g‘j^B  are  defined  as 


A»'.«  =  ln 


+  a,(l-2j:,)g"g  . 


'p'iBP'dA 


As  the  short-range  disorder  parameters  approach  1  and 
the  entropy  of  segregation  term  goes  to  zero. 

III.  RESULTS 

Numerical  results  are  now  presented  for  the  (111)  face 
that  terminates  with  the  cation  surface  of  Hg|_jCdj,Te 
and  Hg|_^ZnjTe  for  temperatures  are  the  critical  tem¬ 
perature  for  phase  separation.  For  the  dll)  face,  the 
second-nearest -neighbor  coordination  numbers  are 
Z  =  12,  Zi^=6,  and  Zy  =  ZB  =  3.  Since  accurate  values 
for  the  surface  tension  of  a  solid  are  difficult  to  obtain, 
workers  in  metal  alloys  have  related  this  term  to  the  heat 
of  vaporization  (A//''*'’)  via^ 

Zj 


As  Wynblatt  and  Ku’^ ”  have  pointed  out,  however,  this 
relation  overestimates  the  surface  energy.  [For  the  (lOOi 
fee  face,  the  overestimation  is  a  factor  of  2.]  The  empiri¬ 
cal  relationship 

X,a  =  A//'*'’/6  (26) 

has  given  better  numerical  results  for  liquid  metals.'  In 
our  study  of  semicon?  i  tor  alloys,  however,  we  have 
used  the  vacancy  formation  energies,  calculated  by  Berd- 
ing  et  al.,'^  for  AE^u,.  This  vacancy  formation  energy 
is  defined  as  the  energy  required  to  take  an  atom  from  the 
bulk  and  place  it  on  a  particular  surface.  These  quanti¬ 
ties  have  been  calculated  using  Harrison's  tight-binding 
theory'”  with  corrections  made  for  the  long-range 
Coulomb  energies  resulting  from  charge  redistribution 
and  rehybridization  of  the  dangling  bonds.  Unlike  other 
authors  who  have  used  concentration-independent 
and/or  empirical  parameters,  the  vacancy  formation  en¬ 
ergy  from  Ref.  15  is  concentration  dependent.  The 
values  for  Hg|_^Zn,Te  are  given  by 

AEHg=-(0.23;cj,-b0.89)  eV  , 

(27) 

A£z„  =  -(0.26Xfc -1-0.98)  eV  , 
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and  for  Hg,  .,Cd^Te 


A£Hg=- (0.14x^+0.891  eV  , 
A£cd  =  -(0. 14xi,  +  1.02)  eV  . 


(28) 


The  pair  interaction  terms  for  the  bulk  (e)  and  surface 
(£*)  are  provided  by  Chen  and  Sher,'^  who  determined 
these  parameters  from  substitutional  energies  when  an  A 
or  B  atom  is  removed  from  the  bulk.  These  substitution¬ 
al  energies  are  calculated  using  Harrison’s  bonding 
theory'*’  along  with  the  valence-force-field  model.” 
Effects  incorporated  in  the  substitutional  energies  include 
strain  and  chemical  terms  that  arise  as  the  alloy  relaxes 
upon  removal  of  an  atom.  For  our  surface  interaction 
parameter,  however,  the  strain  term  in  the  substitutional 
energy  is  set  to  zero  to  allow  for  strain  release.  Our  ap¬ 
proach  to  adding  strain  effects  differs  from  that  of  Wyn- 
blatt  and  Ku.  Their  entha'py  of  segregation  was  ob¬ 


tained  by  taking  Eq.  (14)  and  adding  a  term  denved  by 
Friedel'*  who  used  elastic  continuum  theory  to  calculate 
elastic  strain  energy  in  a  dilute  alloy.  They  did  not 
differentiate  between  surface  and  bulk  interaction  param¬ 
eters  and  considered  only  the  top  surface  as  being 
different  from  the  bulk. 

With  these  parameters,  numerical  layer  compositions 
were  calculated  using  Eqs.  (13)  and  (14)  for  the  regular 
solution  and  Eqs.  (17)  and  (18)  for  the  QCA  solution. 
The  assumption  was  made  that  a  certain  layer  \  is  the 
bulk  layer  and  those  layers  above  the  A'th  layer  (i  <A') 
will  have  different  layer  compositions  than  the  bulk.  A 
total  of  N  nonlinear  coupled  equations  were  then  solved 
for  the  layer  compositions.  In  order  to  make  sure  enough 
layers  were  taken,  the  calculation  was  repeated  for 
A’  =A'  +  1  until  convergence  close  to  the  bulk  value  was 
reached.  For  most  systems  and  temperatures  presented 
in  this  report,  four  layers  was  enough.  For  most  of  our 


FIG.  1.  Fractional  surface  concentration  (x, )  of  CdTe  as  a 
function  of  the  fractional  bulk  CdTe  concentration  (x^)  at  the 
(111)  face  of  Hg| _  ,Cd,Te.  Results  are  from  the  quasichemical 
approximation  using  an  experimental  '■)  and  theoretical  value 
(,j.)  for  the  bulk  interaction  parameter,  (a)  7=1000  K.  (b) 
r  =  600K 


FIG.  2.  Fractional  surface  concentration  (x, )  of  ZnTe  as  a 
function  of  the  fractional  bulk  ZnTe  concentration  (x^ )  at  the 
(111)  face  of  Hg|_,Zn,Te  at  temperatures  of  (a)  7  =  1000  K 
and  (b'  7  =  600  K.  All  nonideal  curves  were  calculated  in  the 
quasichemical  approximation  with  the  theoretical  interaction 
parameter  from  Chen  and  Sher  U/k  =  62  94  K ). 
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cases  we  have  looked  at,  the  regular  and  QCA  formalisms 
gave  similar  results,  so  only  the  QCA  are  presented.  This 
is  borne  out  when  one  calculates  the  short  range  disorder 
parameters,  which  were  always  close  to  one  for  all  in¬ 
teraction  parameters  and  temperatures  considered. 

Figures  1(a)  and  Kb)  show  the  results  for  Hg,  ^^Cd,Te 
at  temperatures  of  1000  and  600  K.  For  the  bulk  interac¬ 
tion  parameter,  the  theoretical  value  from  Chen  and 
Sher'*  is  t/k  =  —2.92  K,  while  an  experimental  value'*" 
of  t/k  =58.75  K  is  also  shown  for  comparison.  For  both 
cases  the  surface  interaction  parameter  is  t^/k  =  —  4.23 
K,  calculated  the  same  way  as  the  theoretical  bulk  value 
but  with  the  strain  term  set  equal  to  zero.  Since  the 
theoretical  parameter  is  small,  both  regular  and  QCA  re¬ 
sults  are  very  close  to  the  ideal  case.  The  layers  concen¬ 
trations  beneath  the  surface  are  very  close  to  the  bulk  so 
are  not  shown.  As  expected,  all  figures  show  that  for  all 


FIG.  3.  Fractional  surface  concentration  (x, )  of  ZnTe  as  a 
function  of  the  fractional  bulk  ZnTe  concentration  (x»)  at  the 
(111)  face  of  Hg,-,Zn,Te  at  temperatures  of  (a)  T  =  1000  K 
and  (b)  7  =  755,7  K.  All  nonideal  curves  were  calculated  in  the 
quasichemical  approximation  with  the  experimental  interaction 
parameter  iz/k  =  125.96  K). 


alloy  concentrations  the  component  with  the  weaker 
bonds  (Hg)  has  segregated  to  the  surface.  Also  note  that 
as  the  temperature  increases  the  amount  of  segregation 
decreases  because  the  entropy  term  (  —  T AS)  is  now  dom¬ 
inating  to  create  more  disorder  (i.e.,  less  segregation). 

Presented  in  this  paper  are  results  for  Hg,^^Zn^Te  us¬ 
ing  both  an  experimental  interaction  parameter,  t/k 
=  125.96  K,  and  a  theoretical  value  t/k  =62.94  K  In 
all  cases  the  surface  interaction  parameter  is  set  at 
t^/k  =6.04  K.  Figures  2(a)  and  2(b)  show  the  segrega¬ 
tion  curves  using  the  theoretical  interaction  parameter  at 
temperatures  of  1000  and  600  K.  As  with  the 
Hg,_,Cd,Te  case,  Hg  segregates  to  the  surface  layers 
with  the  amount  of  segregation  increasing  with  decreas¬ 
ing  temperature.  In  addition,  the  layers  below  the  sur¬ 
face  begin  to  show  appreciable  deviations  from  bulk  con¬ 
centrations  as  the  temperature  decreases.  In  order  to 
show  the  effect  of  strain  release  in  our  calculations,  each 
figure  also  contains  calculated  results  where  the  surface 
interaction  parameter  is  set  equal  to  the  bulk  interaction 
parameter.  Note  that  in  each  case,  the  amount  of  segre¬ 
gation  due  to  strain  release  is  reduced  in  the  HgTe-rich 
side  and  to  a  smaller  degree  the  amount  of  segregation  is 
enhanced  in  the  ZnTe-rich  side.  This  confirms  the  state¬ 
ments  of  Wynblatt  and  Wu'^  '^  in  regards  to  segregation 
in  dilute  compounds. 

Figures  3(a)  and  3(b)  show  the  results  using  the  experi¬ 
mental  interaction  parameter  at  7  =  1000  and  755.7  K. 
The  last  temperature  is  the  critical  temperature  for  mis¬ 
cibility  calculated  in  the  regular  solution  theory 
(Tf  =  Zt/2k).  As  expected,  the  trends  are  the  same  as 
with  the  previous  case.  More  segregation  is  observed  be¬ 
cause  of  the  increased  value  of  t/kT. 

IV.  DISCUSSION 

Models  for  surface  segregation  within  the  framework 
of  the  regular  and  quasichemical  approximations  have 
been  applied  to  the  Il-VI  compounds  Hg,_j,Cd,Te  and 
Hg,_j,Zn^Te.  As  mentioned  in  the  introduction,  it  has 
been  suggested  that  Hg,_j,Zn;,Tc  might  be  a  better  can¬ 
didate  for  an  infrared  material  than  Hg, _;,Cd;,Te.  In  re¬ 
gards  to  surface  segregation,  our  results  show  this  to  be 
the  case.  Let  us  compare  the  amount  of  Hg  enrichment 
at  the  surface  around  the  bulk  concentration  x„ 
=0.1-0. 3,  the  typical  region  for  infrared  applications. 
Referring  back  to  Figs.  1-3,  for  any  given  temperature, 
more  Hg  segregation  is  seen  in  Hg,-  ^CdjTe  than  in 
Hg,_,Znj,Te,  with  the  effects  of  strain  release  providing 
even  less  Hg  segregation  in  the  Hg-rich  side  of 
Hg,_j,Zn,Te.  Figure  4  shows  the  layer  concentrations  at 
the  growth  temperature  of  973  K  for  Hgo  g^Cdo  2oTe  and 
Hgo  gsZng  ijTe.  A;  these  concentrations,  both  have  ap¬ 
proximately  the  same  band  gap.  The  experimental  in¬ 
teraction  parameters  were  used  in  the  calculations.  The 
reduction  in  Cd  at  the  surface  layer  for  Hg^  joCdo  2oTe  is 
65%  while  the  reduction  in  Zn  at  the  surface  of 
Hgo  jsZno  ijTe  is  only  28%. 

In  general,  let  us  reiterate  the  basic  conclusions  that 
have  come  about  from  our  studies. 

(i)  The  amount  of  surface  segregation  on  the  top  layer 
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FIG.  4.  Layer  concentrations  at  the  growth  temperature  of 
973  K  for  Hgo  goCdo  :oTe  and  Hgo  gsZno  uTe.  Experimental  in¬ 
teraction  parameters  were  used. 

depends  primarily  on  the  difference  in  the  vacancy  forma¬ 
tion  energies,  A£^  —^Eg,  which  is  the  difference  in  the 
energies  required  to  move  the  constituent  atoms  from  the 
bulk  to  the  surface  in  the  alloy.  As  expected,  the 
weaker-bonding  elements  segregate  to  the  top  layers. 

(ii)  The  strength  of  the  second-neighbor  term  t/kT 
determines  how  far  from  the  ideal  case  the  curves  will  be 
and  how  many  layers  below  the  surface  some  appreciable 
amount  of  segregation  will  occur. 

(iii)  The  effect  of  strain  release  on  segregation  is  deter¬ 


mined  in  this  formalism  by  the  difference  in  our  surface 
and  bulk  interaction  parameters.  As  with  dilute  alloy 
cases,  if  A  is  the  atom  that  goes  to  the  surface  most  easily 
in  the  absence  of  strain,  then  strain  release  in  ^4,^,  ,,C 
pseudobinary  alloys  will  reduce  segregation  at  the  BC- 
rich  side  and  enhance  it  somewhat  at  the  /4C-rich  side. 

The  models  presented  here  have  considered  only  pair 
interactions.  Cluster  models,  where  the  alloy  is  divided 
up  into  microclusters  of  arbitrary  size  and  all  interactions 
within  the  cluster  considered,  are  better  suited  for  zinc- 
blende  structures  and  provide  a  measure  of  short-range 
correlation  in  the  alloy.  ”  The  application  of  a  cluster 
model  in  surface  segregation  studies  should  provide  more 
exact  results,  especially  at  temperatures  below  the  critical 
temperature  for  order  and/or  disorder.  However,  to  pro¬ 
gress  beyond  predictions  of  trends  more  accurate  esti¬ 
mates  of  the  segregation  interaction  parameters  are  need¬ 
ed. 

While  our  results  for  II-VI  alloys  have  agreed  qualita¬ 
tively  with  metallic  systems  in  regards  to  their  surface  be¬ 
havior,  we  have  not  found  any  specific  surface  segrega¬ 
tion  experiments  or  semiconductor  alloys  to  compare 
with.  We  call  for  experimentalists  to  look  into  this 
phenomenon  in  this  important  class  of  alloys. 
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ABSTRACT 


We  argue  that  passivation  of  HgCdTe  and  HgZnTe  is  related  to  tendencies 
of  the  alloy  constituents  (and  impurities)  to  segregate  to  interfaces.  In  the 
worst  case,  the  surface  is  so  Hg-rich  that  it  is  a  semimetal.  We  present  a 
model  that  offers  insight  into  the  mechanisms  driving  segregation  for  these 
alloys  to  vacuum,  CdTe,  ZnTe,  ZnS,  and  various  oxide  interfaces.  The 
crystal-orientation-depcndence  of  the  effect  is  also  discussed.  We  con¬ 
clude  that  segregation  is  minimized  by  using  CdTe  or  2jiTe  as  the 
passivanL 


I  INTRODUCTION 


This  paper  presents  a  model  of  the  behavior  of  various  passivants  used  on 
Hgi-xCdxTe  and  Hgi.xZnxTe  alloys.  The  model  is  capable  of  explaining  many  of  the 
phenomena  that  impact  surface  leakage,  which  surfaces  are  passivated  most  easily,  and 
how  various  passivants  modify  active  device  elements.  The  model  treats  segregation  of 
the  alloy  constiments  at  interfaces  and  is  extendable  to  impurities.  In  all  alloys,  the  equi¬ 
librium  concentration  is  never  constant  from  the  bulk  to  an  interface  (particularly  a  free 


*  This  paper  was  presented  as  an  added  paper  at  the  1989  meeting. 
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surface);  one  species  always  segregates  preferentially.  1  The  interface  concentration  gen¬ 
erally  returns  to  its  bulk  value  in  a  few  atomic  spacings,  depending  on  the  material  in  the 
passivating  coating  and  the  effective  temperature  at  which  the  equilibrium  is  established. 
Segregation  of  this  sort  is  well-documented  in  metal  alloys,  2.3  but  has  received  little 
attention  in  semiconductors.  Concepts  developed  in  the  metals  literature  have  been 
incorporated  into  our  formalism.  At  this  point  .many  quantitative  details  of  the  model  are 
still  incomplete;  however,  we  have  identified  the  major  underlying  physical  phenomena 
and  trends  among  a  broad  range  of  effects  can  be  deduced  fiom  the  model’s  logic 
structure. 


n  DISCUSSION 


The  vacuum  surface  of  an  Ai.xB^C  alloy  will  be  discussed  first.  For  definiteness, 
consider  a  (1 1 1)  A  surface.  Under  ideal  circumstances,  this  surface  is  cation- terminated, 
and  each  cation  makes  three  bonds  to  the  underlying  anion  layer  and  having  one  dangling 
hybrid  pointing  to  the  vacuum.  In  our  calculation,  the  solid  is  thought  of  as  a  collection 
of  anion  and  cation  layer  pairs.  ^  Interactions  among  the  atoms  in  each  layer  and  between 
the  layers  are  summed  to  get  the  enthalpy  of  the  solid.  The  entropy  is  also  evaluated 
using  a  new  quasichemicai  combinatorial  method.*^  The  free  energy  is  miniirized  to  find 
the  composition  xj,  i  =  1,2,3,...  of  each  layer.  Details  of  this  method  can  be  found  in  prior 
publications.^-^ 


^Patrick,  R.S.,  A.-B.  Chen,  A.  Sher,  and  M.A.  Herding,  1989;  “Surface  Segregation  in 
Pseudobinary  Alloys,”  Phys.  Rev.  B,  Vol.  39.  No.  9,  pp.  5980-5986.  Reference  to  much  of  the 
relevant  literature  of  this  subject  can  be  found  in  this  paper. 

2  WiUiams,  F.L.,  and  D.  Nason,  1974;  Surf.  ScL,  Vol.  45.  p.  377. 

3  Kumar,  V.,  D.  Kumar,  and  S.K.  Joshi,  1979;  Phys.  Rev.  B,  Vol.  19,  p.  725. 

^  Sher,.  A.,  M.  van  Schilfgaarde,  A.-B.  Chen,  and  W.  Chen,  1987;  “Quasichemicai 
Approximations  in  Binary  Alloys,”  Phys.  Rev.  B,  Vol.  36.  No.  8,  pp.  4279-4295  (September). 
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Two  major  effects  drive  segregation.  The  first,  referred  to  as  the  chemical  driving 
energy,  is  related  to  the  fact  that  the  solid  can  minimize  its  enthalpy  by  having  the  con¬ 
stituent  on  the  surface  that  makes  the  weaker  bond.  However,  having  aU  of  one  class  of 
atoms  on  the  surface  is  unfavorable  to  the  entropy.  Hence,  the  surface  concentration  is  a 
compromise.  The  second  driving  mechanism  is  strain  release.  An  impurity  that  has  a 
bond  length  mismatched  with  the  major  constituent  has  less  strain  energy  on  the  surface 
than  in  the  bulk.  Hence,  the  low  concentration  constituent  of  a  bond-length-mismatched 

alloy  tends  to  go  to  the  surface,  e.g.,  in  Hgi_xZnxTe  for  x  >  0.15,  the  Zn  is  driven  to  the 
siuface.  Near  x  =  0.5,  neither  species  is  preferentially  driven  to  the  surface  by  strain 
release,  and  in  the  high-concentration  region  the  opposite  species  goes  to  the  surface.  If 
the  low  concentration  species  also  makes  the  weak  bond,  then  the  two  mechanisms  rein¬ 
force  one  another  and  enhance  segregation;  in  the  opposite  case,  they  compete  and  seg¬ 
regation  is  reduced.  In  Hgi_xZnxTe  for  x  <  0.25,  the  mechanisms  compete  so  as  to 
reduce  segregation.  Strain  release  plays  Uttle  role  in  Hgi_xCdxTe,  because  the 
components  are  nearly  bond-length-matched. 

In  both  Hgi_xCdxTe  and  Hgi_xZnxTe,  the  equilibrium  surfaces  are  Hg-rich  relative 
to  the  bulk.  The  layer  concentration  renims  almost  to  that  of  the  bulk  by  the  second 
layer.  However,  in  both  cases,  for  x  in  the  range  used  for  long-wave  detectors,  the 
surfaces  are  semimetals  that  look  n-type.  Thus,  any  passivation  procedure  must  begin  by 
carefully  removing  any  equilibrium  surface  layers.  Rgures  1  and  2  show  the  equilibrium 
vacuum  surface  concentration  versus  the  bulk  concentration  for  a  sample  equilibrated  at 
973  K.  If  a  lower  temperature  had  been  used,  the  segregation  would  have  been  more 
severe.  One  problem  with  HgCdTe  may  be  that  diffusion  persists  to  low  tcmperanire; 
hence,  the  effective  equilibration  temperature  may  be  low. 

The  (1 1 1)  A  surface  has  one  dangling  cation  bond.  The  (110)  surface  also  has  one 
dangling  cation  bond,  but  the  bond  density  is  higher  than  on  the  (1 1 1)  A  surface.  Hence, 
the  (1 10)  surface  segregates  slightly  more  than  (1 1 1)  A.  However,  there  are  two  and 
three  dangling  bonds  per  cation,  respectively,  on  the  (100)  and  (1 1 1)  B  surfaces.  Hence, 
those  regions  of  the  surface  that  are  cation-terminated  should  be  progressively  more  diffi¬ 
cult  to  passivate,  at  least  in  Hgi_xOixTe,  where  the  chemical  term  dominates.  However, 
particularly  for  the  (1 1 1)  B  surface,  a  fair  portion  of  the  surface  will  be  Te-terminated. 
e  underlying  cations  will  then  make  four  bonds  to  Te,  just  as  the  bulk  atoms  do.  Thus, 
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passivation  should  be  easy  in  these  regions.  Because  the  Te- terminated  patches  are  easy 
to  passivate,  and  because  the  cation- terminated  patches  are  difficult  to  passivate,  this  is 
another  source  of  variability  for  imaging  arrays. 

The  way  these  effects  are  modified  by  CdTe  and  ZnTe  capping  layers  is  also  shown 
in  Figs.  1  and  2.  For  Hgi_xQlxTe,  the  segregadon  in  Fig.  1  is  almost  completely 
removed  by  both  passivants.  The  reason  is  that  the  cadons  at  the  interface  between  the 
HgCdTe  and  the  capping  layer  now  make  four  Te  bonds,  just  as  they  do  in  the  bulk.  As  a 
consequence,  the  only  energy  difference  between  a  Hg  or  Cd  in  the  bulk  and  at  the  inter¬ 
face  stem  from  small  metallizadon-induced  local  bond  and  small  back-bond  changes. 

The  effect  of  these  passivants  on  Hgi-xZ^Te  seen  in  Fig.  2  is  less  complete  because  of 
the  strain  release  contribudon.  Here,  the  ZnTe  leaves  the  interface  slighdy  Zn-rich,  while 
CdTe  leaves  the  interface  slightly  Hg-rich  rcladve  to  the  bulk.  However,  both  passivant 
materials  sdll  are  reasonably  effeedve. 

The  results  in  Rgs.  1  and  2  are  ideaUad:  They  are  calculated  as  if  the  CdTe  and 
ZnTe  passivadon  layers  stay  fixed  at  the  equilibradon  temperature,  973  K  in  these  fig¬ 
ures.  This  is  unrealisdc,  the  CdTe  will  certainly  interdiffuse  with,  for  example,  the 
HgCdTe  to  leave  a  graded  concentradon  region  at  the  interface  that  is  Cd-rich.  This  grad¬ 
ing  would  have  a  range  determined  by  the  dme  diffusion  is  permitted  to  occur  and  could 
be  many  layers  thick.  On  the  other  hand,  the  ZnTe  will  not  interdiffijsc  into  the  HgCdTe, 
if  after  it  is  deposited,  all  processing  remains  below  the  cridcal  spinodal  transidon 
temperature.  Thus,  ZnTe  may  prove  to  be  a  more  temperature-stable  passivant.  Only  a 
few  atomic  layers  of  ZnTe  are  needed  to  serve  as  the  passivant;  accordingly,  strain- 
induced  misfit  dislocations  can  be  avoided.  If  a  thicker  insulator,  or  a  protective  layer  is 
needed  (rather  than  just  a  layer  to  cut  down  surface  leakage),  CdTe  or  some  insulator  can 
be  put  on  top  of  the  ZnTe  to  finish  off  the  passivation. 

The  effect  of  ZnS,  Photox,  Si02,  or  anode  oxides  can  also  be  deduced  from  this 
model.  Take  ZnS  as  an  example.  Now  on  the  (1 1 1)  A  surface,  the  extra  cation  bond  is 
made  to  a  sulfur.  The  CdS  and  HgS  bonds  arc  much  stronger  than  CdTe  or  HgTe,  respec¬ 
tively,  and  the  CdS  bond  is  much  stronger  than  HgS.  Thus,  the  CdS  bond  dominates  the 
behavior,  and  Cd  preferentially  segregates  to  the  interface.  While  this  is  preferable  to 
having  a  Hg-rich  interface,  because  it  opens  the  bandgap  rather  than  narrowing  it,  other 
troublesome  side  effects  may  be  caused.  We  have  not  yet  discussed  dopant 
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SURFACE  CONCENTRATION  OF  ZnTe  P  INTERFACE  CONCENTRATION  OF  CdTe 
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IRE  1  INTERFACE  CONCENTRATION  OF  Cd  IN  Hgi.,Cd,T«  FOR  VARIOUS 
PASSIVATING  CAPPING  MATERIALS 


FIGURE  2  INTERFACE  CONCENTRATION  OF  ZnTe  IN  Hg Zn, Te  FOR  VARIOUS 
PASSIVATING  CAPPING  MATERIALS 
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(nonisoelectronic  impurity)  segregation.  If  the  dopants  preferentially  segregate  to  the 
interface,  that  too  can  cause  high  surface  leakage.  Most  impurities  also  make  stronger 
bonds  to  S  than  to  Te,  and  so  will  tend  to  segregate  to  the  interface.  No  quandtadve  anal¬ 
ysis  of  these  effects  have  been  done  yet,  but  the  direcdon  of  trends  is  clear. 

Oxides  will  behave  like  the  sulfides,  but  the  likelihood  of  epitaxial  interfaces  over 
substandal  regions  is  smaller.  Hence  at  a  (1 1 1)  A  interface,  for  example,  where  the  bond 
density  of  the  semiconductor  and  capping  layer  are  grossly  different,  one  expects,  a 
distribudon  of  cation-oxygen  and  dangling  bonds.  Thus,  segregation  will  be  driven  by 
the  statistical  average  of  these  effects  over  a  given  domain.  If  different  domains  have 
different  oxide  crystal  orientations,  stoichiometries,  or  (in  the  native  oxide  case)  different 
compositions  (e.g.,  tellurium  oxide,  cadmium  oxide),  or  mercury  oxide,  then  there  will  be 
segregation  patterns  over  the  Hgi_xCdxTe  surface  with  different  cd  and  dopant 
concentrations.  As  long  as  the  interface  is  always  C!d-rich,  this  may  cause  miiumal 
problems,  but  it  is  bound  to  affect  the  unifonnity  of  array  performance. 


m  SUMMARY 


The  model  presented  here  is  incomplete,  but  broad  guidelines  for  Hgi_xCdxTe  passi¬ 
vation  can  be  stated: 

•  The  top  few  layers  of  material  must  be  removed  before  deposition  of  a  passiva¬ 
tion  layer. 

•  Te-beaiing  compounds  are  the  best  materials  choice  to  minimize  interface  con¬ 
centration  profiles. 

•  Epitaxial  layers  are  best,  but  if  polycrystalline  materials  are  used,  they  should  at 
least  be  properly  oriented  polycrystals. 

•  ZnTe  may  be  more  stable  against  interface  thermal  degradation  than  CdTe. 

•  The  easiest  orientations  to  passivate  are  the  (1 1 1)  A  and  (110),  and  progres¬ 
sively  harder  ones  are  cation  patches  on  the  (100)  and  (1 1 1)  B  surfaces  and  on 
these  latter  surfaces  careful  treatments  to  avoid  cation  patches  may  be  espe¬ 
cially  helpful. 

•  Nothing  has  been  said  about  radiation  damage  hardness  yet,  but  the  model 
obviously  offers  a  logic  structure  in  terms  of  which  to  understand  the  question. 
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Abstract.  The  band  structures  of  HgTe.  CdTe,  ZnTe,  the  alloys  Hg,  _  ,Cd.Te  (hct) 
and  Hg,  _  .2n,Te  (hzt),  and  several  small-gap  superlatlices  (sl)  are  calculated 
using  a  tight-binding  model  Our  calculations  show  a  nearly  linear  dependence  of 
the  energy  gap  on  the  concentration  in  mct,  but  a  strong  non-linear  variation  in  hzt 
The  electron  mass  as  a  function  of  the  band  gap  is  found  to  be  the  same  in  hct  and 
HZT  in  the  small-gap  region  Our  calculated  sl  bands,  in  fair  agreement  with  the 
most  recent  experiments  and  theories,  support  the  assumption  of  a  large  valence 
band  offset  (350  meV)  between  HgTe  and  CdTe 


1.  Introduction 

Because  of  structural  weakness  in  Hg|.,Ccl,Te  (hct). 
the  Hg:  x2n.,Tc  alloys  (HZT)  and  related  superlattices 
(SL)  may  become  compeiitisc  infrared  materials.  In  this 
paper  we  examine  the  important  parts  of  the  band 
structures  of  hct,  hzt  and  several  SL  systems.  Our  study 
is  based  on  a  second-neighbour  tight-binding  (sntb) 
model,  which  has  been  implemented  for  all  the  systems 
concerned,  including  the  pure  compounds,  the  alloys  and 
the  SL.  The  spin-orbit  interactions  and  the  strain  effects 
in  the  sl  are  also  included  The  alloy  disorder  is  treated 
within  the  molecular  coherent  potential  approximation 
(mcpa)  [1.2].  The  sl  band  structures  are  calculated  using 
a  difference  equation  approach  [3]. 

2.  Pure  compounds  HgTe,  CdTe  and  ZnTe 

The  SNTB  model  uses  four  orthonormal  local  s-  and  p- 
orbitals  per  atom.  The  first-  and  second-neighbour  inter¬ 
actions  are  assumed  to  be  of  the  two-centre  typie.  The  s- 
term  values  c,  are  taken  such  that  the  -e,  are  the 
experimental  first  ionisation  energies  of  the  atoms.  The  p- 
term  values  £,  and  other  interaction  parameters  are 
adjusted  to  fit  the  important  band  quantities  of  the  pure 
compounds.  Figures  l(o)  1(f)  display  the  calculated 
bands  for  the  three  pure  compounds.  In  comparing  the 
present  model  with  the  most  popular  tb  band  model  for 
the  HgTe  CdTe  sl  by  Schulman  and  Chang  (sc)  [4],  we 
found  that  the  bands  around  the  gaps  from  both  models 
are  rather  similar.  The  largest  discrepancy  between  the 
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two  models  is  in  the  valence  band  width.  Our  model  puts 
the  Xj,  level  at  the  experimental  values  fS]  of  -5  and 
-5.5eV  respectively  for  CdTe  and  HgTe.  while  the  cs 
model  sets  it  about  2  eV  deeper.  Since  this  discrepancy 
happens  at  an  energy  far  from  the  fundamental  gap.  it 
should  not  strongly  affect  the  band  structures  of  the  sl 
near  the  gap. 


3.  HCT  and  hzt  alloys 

The  band  parameters  of  the  pure  compounds  are  used  in 
the  alloy  calculation  without  any  further  adjustment.  The 
dominant  disorder  in  hct  and  hzt  comes  from  the  large 
difference  in  the  s-term  values  between  the  cations 
However,  the  fluctuations  in  the  off-diagonal  matrix 
elements  will  also  affect  the  detailed  band  structures.  This 
alloy  disorder  is  treated  within  the  mcpa  as  described  in 
[1.  2]. 

The  major  results  for  the  alloys  are  presented  in  figure 
2,  where  the  band  gap  is  plotted  as  a  function  of  the  alloy 
concentration,  and  in  figure  3,  where  the  conduction 
band  mass  is  plotted  as  a  function  of  the  band  gap.  For 
hct  the  band  gap  £,  deviates  only  slightly  from  the 
straight-line  average  of  the  pure  compound  values, 
However,  hzt  shows  a  significant  bowing  below  the 
average  value.  Figure  3  indicates  that  the  mf-value  for  a 
device  operating  at  the  same  wavelength  in  the  narrow- 
gap  region  is  nearly  the  same  whether  hct  or  hzt  is  used. 
Finally  we  note  that  the  present  calculation  agrees  with 
[2]  in  the  mcpa  self-energy,  indicating  that  the  disorder 
effect  on  the  lifetime  is  important  only  for  states  well 
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Band  structures  ol  HgCdTe  and  HgZnTe 
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Figure  1.  The  calculated  hand  structures  of  (a)  2nTe.  (h)  CdTe  and  (c)  HgTe 
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Figure  2.  The  band  gaps  of  Hg,  _  ,Cd,Te  and  Hg,  _  ,Zn,Te  Figures.  The  electron  masses  ot  Hg,  _  .Cd.Te  and 
as  a  function  of  the  alloy  concentration  *  Hg,  _  ^n.Te  as  a  function  of  the  band  gap 


above  and  below  the  band  edges  in  both  hct  and  hzt,  apparent  from  the  theory  that  the  band  shrinks  as  the 

and  is  not  a  limiting  factor  on  the  mobilities  of  electrons  ratio  n,m  increases,  whereas  the  experimental  data  show 

and  holes  in  the  low-field  transport  in  hct  and  HZT.  an  increase  in  the  gap  going  from  the  (16.  16)  to  (17.  15) 

samples.  For  those  samples  in  which  there  is  a  clear  gap. 
our  calculations  with  A£\  =  350  meV  are  seen  to  agree 
4.  Superlattices  with  experiments  slightly  better  than  the  sc  model  The 

comparisons  for  the  first  three  systems  with  (n.m)  equal 
To  use  the  tb  model  to  treat  the  SL.  we  further  allow  a  to  (24.  9).  (25,  10)  and  (24.  15)  are  less  transparent, 
rigid  shift  of  all  the  term  values  in  a  slab  with  respect  to  because  the  SL  band  structure  has  a  nearly  zero  or 

the  other  to  produce  the  desired  valence  band  offset  A£,.  negative  gap.  While  we  only  list  the  gap  at  A;  =  0,  the 

Table  1  compares  our  calculated  band  gaps  at  At  =  0  for  actual  gaps  may  occur  at  other  At  points.  The  complexity 

the  (001)  SL  HgTe'Cdo  8,Hgo  ,,Te  with  the  experimental  of  the  bands  may  be  illustrated  by  comparing  our  bands 

and  theoretical  values  based  on  the  sc  model  quoted  in  for  the  sample  with  (n,  m)  =  (26. 9)  shown  in  figure  4  with 

[6]  for  several  combinations  of  slab  thickness  repre-  those  in  figure  9  of  [6]  for  the  same  system  using  the  same 

sented  by  the  numbers  of  double  layers  of  atoms  (n,  m)  in  A£,  =  350  meV. 

the  slabs.  First  we  see  a  small  but  consistent  discrepancy  In  figure  4  the  bands  are  plotted  along  k.  (perpendi- 

bctween  the  two  theoretical  models.  The  sc  model  yields  cular  to  the  slab)  and  k,  (parallel  to  the  slabs).  Along  k.. 

a  gap  about  10  30  meV  smaller  than  ours.  The  expcri-  all  the  bands  which  are  derived  from  the  heavy-hole 

mental  data  are  in  belter  agreement  with  the  calculations  bands  of  the  slabs  are  practically  horizontal  lines  owing 
with  a  valence  band  offset  A£v  =  350  meV  between  to  the  large  A£,  used.  The  only  two  bands  with  apprecia- 

HgTe  and  CdTe  than  with  A£,  =  40  meV.  However,  the  ble  dispersion  are  those  derived  from  the  conduction 

agreements  are  not  really  satisfactory.  For  example,  it  is  band  and  the  light-hole  band  The  one  derived  from  the 
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Table  1.  Comparison  between  the  calculated  band  gaps  (in  meV|  in  the  present 
work  with  the  experimental  and  theoretical  values  of  [6]  for  the 
(001)HgTe/Hgu  ,5Cdo6;,''’e  superiattices 


No  ot  layers 
(HgTe/HgCdTe) 

Trveory 

Experi 

ment  [6] 

A£.  =  40  meV 

Present  [6) 

Af ,  =  350  meV 

Present  (6) 

Transport 

Optical 

(24.9) 

33 

5 

-  1 

-30 

<5 

-  16 

(24,10) 

37 

12 

2 

16-26 

<8 

-10 

(24.15) 

65 

51 

16 

1 

<5 

-3 

(19.12) 

89 

68 

43 

25 

35 

41 

(18,14) 

107 

89 

56 

42 

54 

53 

(16,16) 

135 

119 

81 

70 

56 

(17.15) 

121 

104 

68 

55 

80 

64 

(18,18) 

117 

104 

61 

50 

81 

Figure  4.  The  calculated  band  structure  of  (001) 
HgTe/Hgo  i^CdoesTe  superlattice  with  the  numbers  of 
layers  (n.  m)  =  (2o.  9)  and  with  a  valence  band  offset 
-  350  meV  between  HgTe  and  CdTe 


conduction  band  crosses  a  heavy-hole  band  at  k.  =  = 

035n  d  in  our  model  and  at  kj  =  0A2n  d  in  [9],  where  d 
is  the  SL  period.  This  is  therefore  a  zero-gap  system.  The 
light-hole  mass  for  the  band  along  k,  for  k.  =  k^.  as 
shown  by  the  broken  curve,  is  extremely  small,  m*  < 
0.(K)1;  this  was  also  found  in  the  Shubnikov-de  Haas 
experiment  on  the  hole  orbit  of  this  system  by  Seiler  el  al 
[7],  who  also  found  a  sudden  increase  in  the  mass  going 
from  the  low  to  the  high  magnetic  field  They  indicated 
that  the  rapidly  changing  mass  as  a  function  of  k,  can 
explain  the  experimental  result.  The  largest  difference 
between  the  two  calculations  is  the  location  of  the  band 
originating  from  the  slab  conduction  band  at  k  =  0.  Our 
calculation  gives  an  inverted  gap  of  -  12  meV  while  the 
cs  model  gave  a  value  of  about  -40  meV.  We  also  note 
that  in  the  side  bands  (along  kj  with  k.  =  0.  the  top 
valence  band  goes  down  and  then  up  as  k,  increases.  This 
is  due  mainly  to  the  inclusion  of  the  strain  in  the  HgTe 
slabs.  This  intrinsic  complexity  in  the  SL  band  structures 
and  other  extrinsic  complications  such  as  interfacial 
diffusion  and  non-uniformity  of  the  Hg  concentration  arc 
among  the  difficulties  in  pinning  down  the  band  struc¬ 
tures  in  the  small-gap  SL. 
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S.  Conclusions 

The  band  structures  for  the  pure  compounds  HgTe. 
CdTe  and  ZnTe.  their  alloys  and  the  SL  made  from  these 
systems  have  been  studied  using  an  sntb  model.  The 
results  for  the  important  parts  of  the  band  structures  for 
the  pure  compounds  and  alloys  are  in  substantial  agree¬ 
ment  with  our  previous  results  [2]  based  on  a  more 
elaborate  scheme  The  most  important  result  for  the 
alloys  is  that  hzt  is  as  good  an  infrared  material  as  HCr  in 
terms  of  the  electronic  structures  in  the  smail-gap  region. 
Our  results  for  the  sl.  in  fair  agreement  with  the  most 
recent  theoretical  and  experimental  studies,  support  the 
assumption  of  a  large  valence  band  offset  of  350  meV 
between  HgTe  and  CdTe  However,  the  sl  band  struc¬ 
ture  is  less  understood  than  that  in  the  alloys  More 
detailed  comparisons  between  theories  and  experiments 
are  needed  to  improve  our  understanding  of  the  band 
structures  of  small-gap  superiattices. 
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The  band  theory  of  Korringa.  Kohn.  and  Rosioker  tkKR  based  on  the  Green-function  method 
IS  extended  to  space-filling  potentials  A  numerical  lest  using  the  Mathieu  potential  shows  good 
convergence  for  the  bands  up  to  15  Rv  with  /  54  included  in  the  angular-momentum  expansion  for 
the  wave  functions  Our  results  strongly  support  the  applicability  of  the  lUll-poieniial  K.KR  to  bulk 
electronic -St  rue  lure  problems. 


I.  INTRODUCTION 

The  Kornnga,  Kohn,  and  Rostoker  (KKR)  band 
theory'  ■  is  an  elegant  theory  for  the  one-electron  energy 
bands  in  a  closed-packed  crystal  for  which  the  muffin-tm 
(MT)  construction  for  the  potential  is  a  reasonable  ap- 
pro.ximation.  To  expand  the  scope  of  application,  consid¬ 
erable  effort  has  been  expended  to  extend  the  KKR 
theory  to  full  crystal  potentials.'  "  One  concern  about 
such  extension  is  related  to  the  so-called  near-field  correc¬ 
tions  (NFC)  (Refs.  3-5  I  arising  from  the  expansion  of  the 
KKR  Green  function  beyond  the  muffin-tin  region.  Al¬ 
though  there  are  proofs  showing  that  NFC  do  not 
exist,  questions  have  been  raised  about  the  applicability 
of  the  theory.''  Since  space-filling  potentials  are  non- 
spherical  and  the  Wigner-Seitz  cell  boundary  is  not 
smooth,  w  e  are  further  concerned  about  the  speed  of  con¬ 
vergence  in  terms  of  angular-momentum  (/)  expansions. 
In  this  paper  the  integral  equation  approach  of  Kohn  and 
Rostoker'  (KR)  is  used  to  derive  the  full-potential  KKR 
(FP-KKRi  equation  explicitly.  One  advantage  of  our 
derivation  is  that  all  the  quantities  involved  are  functions 
of  r  within  a  unit  cell.  Thus  we  can  avoid  the  uncertainty 
in  extending  the  wave  function  beyond  the  unit  cell  en¬ 
countered  in  some  other  derivations.*’"  We  have  also 
tested  the  convergence  by  comparing  the  numerical  re¬ 
sults  with  the  exact  solution  for  the  Mathieu  poten¬ 
tial''  '■  in  the  simple-cubic  crystal.  Excellent  results  for 
the  band  structure  in  the  energy  range  of  interest  are  ob¬ 
tained  with  a  maximum  value  of  /  =  4  included  in  this  ex¬ 
pansion. 

The  fact  that  the  Mathieu  potential  is  exactly  soluble 
gives  it  an  advantage  for  testing  purposes  over  working 
with  realistic  potentials.'* Our  test  complements  the 
empty-lattice  potential''  '"  to  provide  a  stringent  test  for 
the  FP-KKR  theory.  The  strong  angular-momentum 
dependence  in  the  Mathieu  potential  gives  a  good  repre¬ 
sentation  of  the  anisotropy  that  is  present  in  the  open 
structures  pertaining  to  many  semiconductors  and  insula¬ 
tors.  The  restriction  of  the  KKR  to  closed-packed  met¬ 
als  imposed  by  the  muffin-tm  approximation  is  lifted  by 


the  full-potential  method  discussed  here.  The  results  ob¬ 
tained  here  should  encourage  the  application  of  this 
theory  to  real  crystals 


II.  THE  FULL-POTENTIAL  KKR  EQl  ATION 


In  this  section  we  want  to  shovv  that  the  Kohn- 
Rostoker  integral  equation  can  be  simply  extended  to  ob¬ 
tain  the  full  potential  KKR  theory.  The  Schrodinger 
equation  in  the  band  calculation 

(  -  r’ -t- Ft r  1  =  £t/\i r)  il) 


for  a  full  crystal  potential  Uir)  is  equivalent  to  solving 
the  following  integral  equation:' 

i\(£,ri=J'  C^i £ ;r,r' ) Ui r' it/’^t £.r’ Idr'  ,  )2i 


where  the  integration  is  over  the  Wigner-Seitz  cell  of 
volume  7,  and  k  is  a  crystal  wave  vector.  G^i£;r,r’t  in 
Eq.  (3'  is  the  KKR  free-electron  Greer's  function' 


1  exp[t(  K. +k  )•(  r  -  r' I] 


iK,+k)'-£ 


where  K„  are  the  reciprocal-lattice  vectors.  Alternative¬ 
ly  G^t  £  ;r.r  can  be  expressed  as' 


G^(£;r,r') 


exp'  iK  r-  r’  -  R/  ) 

r-r'-R 


exp(  ik  R,  I  . 


(4i 


where  x  =  V  £  for  £>  0  and  if  =  /  v  £  for£<0,  andR, 
are  the  lattice  translation  vectors.  To  derive  the  FP- 
KKR  equation,  we  first  observe  that  Eq.  (3)  can  be  cast 
into  a  surface  integral, 

[G|j( £;r,r' lV'i/,,( £,r' ) 


-i/'kl£.r'iV'Gki£;r,r')]-dS'  =  0  .  (5i 


w  here  S.  is  the  surface  of  the  Wigner-Seitz  cell. 

Since  the  r’  in  the  surface  integral  exceeds  the 
muffin-tin  radius  .  we  need  to  consider  the  expansion  of 
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the  Green  function  beyond  the  original  range  of  Kohn 
and  Rostoker.  Several  authors  have  already  considered 
this  point.  For  simplicity,  we  shall  only  consider  the  case 
with  one  atom  per  unit  cell.  Since  this  expansion  is  a  cen¬ 


tral  poml  of  coniroversy,  we  rederise  the  result'  explic- 
ly  in  the  Appendix  for  ihe  range  of  r  and  r  needed  heie. 
We  show  that  the  expansion 


Gk!£;r,r'l=  2  2  I'  ^  * | 

L  L  \ 


is  valid  as  long  as  both  r  and  r'  are  inside  r  and  satisfy  the 
following  condition. 

r  <  r' ;<  ,Rj  for  all  ^0  .  (7) 

In  Eq.  (6)  the  notations  j£^(<cr)=j;(Kr IF^tr)  and 
S i(kt)  =  niiKf  \Y i_{t)  are  used,  where  and  n,  are,  re¬ 
spectively,  the  spherical  Bessel  and  Neumann  functions, 
is  a  real  spherical  harmonics,  and  L  represents  the 
double  indices  is  the  usual  KKR  struc¬ 

ture  constant.'"  We  note  that  for  any  r  smaller  than 
r^,  the  condition  in  Eq.  (7)  is  satisfied  for  all  r'  contribut¬ 
ing  to  the  surface  integration  in  Eq.  (5).  The  condition 
■  r'i  <  Rj  in  Eq.  (7)  holds  for  most  lattices;  exceptions 
are  those,  for  example,  with  long  narrow  cells.  For  such 
cases,  this  condition  can  be  satisfied  by  breaking  the  unit 
cell  into  smaller  cells  including  so-called  "empty  cells" 
which  do  not  contain  an  atomic  nucleus. 

The  wave  function  inside  the  cell  r  can  be  expanded  in 
a  basis  set  as 

4\iE,r^= .  (8) 

L 


The  basis  function  <t>^  lE.n  is  a  regular  solution  to  the 
Schrodinger  equation  inside  r. 

[-T--r  U(r)]<I>i(£.ri  =  £4>j£.r)  . 

and  behaves  like  Ji^iKTt  at  the  origin  r  =r„— .0.  which  is 
typically  the  location  of  the  atomic  nucleus.  This  basis 
set  can  be  calculated  using  the  following  integral  equa¬ 
tion:'* 

<t>£( £.ri  =  )  +  2  /  gz.  l£;r-f’  'k  ir'  £.r  Id  ' 

L- 

ilO' 

where  g^(£;r,r’)  is  a  free-particle  Green's  function  and 
is  defined  as 

g^!£;r,r')  =  K'(d^((cr).Vi(KT')  — .V^lxTldiiicr')]  .  Ml' 

We  note  that  the  basis  function  <t>^i£,ri  is  coupled  to 
other  angular-momentum  channels  for  r  >  r^.  because  the 
crystal  potential  Fir)  is  not  spherical. 

The  expansions  of  C  in  Eq.  (6)  and  of  tl'  in  Eq.  <8i  can 
be  substituted  in  Eq.  i5i  to  obtain 


2>/£('‘'r)2  ^  E)  -^kCh^AE)  a;^-(k,£)  =  0,  r  <r„  , 

L  L-  1  L- 


where 

Si  I  \E)  =  k  j ^  (du  ller''  I £,r' i]  dS'  .  (13) 

and 

Cii  -{E)  =  K  [.\7(KT').4)i  .(£.r'']dS'  .  (14) 

In  the  above,  the  notation  [£,.£;]  =  £,V'f; —£-V'£| 
has  been  used.  The  surface  integrals  in  Eqs.  (13)  and  M4) 
2-e  over  the  boundaries  of  r  as  indicated  by  S..  Since 
Jilxri  in  Eq.  M2)  are  linearly  independent  functions,  the 
following  set  of  homogeneous  equations  holds: 

2  [2''''Ar(k.£)S^-i-(£) 

L-  I  L- 

+  kCii-IE)  ai-(k,E)  =  0  .  (15) 

This  IS  the  FP-KKR  equation  that  we  are  after. 


We  note  that  our  derivation  is  similar  to  Nesbet's 
derivation.''  We  hope,  however,  that  the  above  explicit 
derivation  may  be  more  accessible  to  some  readers.  It  is 
also  useful  for  establishing  the  notation  necessary  for  the 
description  of  the  application  of  FP-KKR  theory  to  the 
solution  of  the  Mathieu  potential  which  constitutes  the 
main  result  of  this  paper. 

III.  CALCULATION  OF  5  AND  C  MATRICES 

The  surface  integrals  for  the  S  and  C  matrices  in  Eqs. 

(13)  and  (14)  can  be  very  time  consuming.  It  is  desirable 
to  seek  simplification  of  these  calculations  in  a  real  appli¬ 
cation.  One  plausible  approximation  which  is  consistent 
with  the  KKR  spirit  is  to  expand  every  quantity  involved 
in  angular-momentum  components.  Equations  (13)  and 

(14)  are  equivalent  to  the  volume  integrations 

Si  i{E)  =  K  f  Ji  (Kr)ViT)<t>i(E.T)dT  (16) 
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Q.^(£>  =  -6^  ^.  +  )c  J  Ni^  {Ki)y{T)<Pi^{E,T)dT  .  (17) 

One  can  free  the  limits  of  these  integrals  by  replacing  the 
crystal  potential  Kby  the  truncated  potential  y^, 

I''iri=  rir)a(r)  ,  (18) 


A<I>i(f ;/•>=  2  r'^gi-(£;r.r')AK(r')££(£;r')dr' 

L 

+  2  //gi  (£;r,r')r(r')A<I>£(£;r')</r'  , 

(261 


where  a(  r*  is  a  step  function  and  is  defined  as 

1 1,  for  r  within  r 
|0.  otherwise. 


iteratively  using  angular-momentum  expansions  for  all 
quantities  involved. 

IV.  APPLICATION  TO  MATHIEL  POTENTIAL 


The  angular-momentum  expansion  for  the  basis  function 
is  assumed  to  be 

4>i.(£,r)=  .  (20) 

f 

and  the  truncated  potential  K^is  expanded  as 

I''’(r)=  2  •  (21) 

L 

The  integrations  in  Eqs.  (16)  and  (17)  can  be  reduced,  re¬ 
spectively,  to  the  simple  radial  integrations 

S^-^(£)  =  fc- 2 ;^.(icr  )I^J.  .^..(r)(i^..^(£,r)r‘dr  (22) 

and 

E)  =  —b^  +  K  ^  /,  niAKr)y 

Xdi_-i(E,r)r'dr  ,  (23) 

where  is  the  radius  of  the  circumscribing  sphere  of  the 
Wigner-Seitz  cell.  The  E^^  ir)  is  given  by 

££'^-(r)=  f  Yi’(riy'(T)Yi^  -(T)dn 

=  2  QV"  .  (24) 

L 

where 

Ct  L  =J  yt.(r)Yi(r)y^^..(rldn  (25) 

is  a  Gaunt  coefficient. 

Note  that  in  the  above  the  basis  function  <t>£^(£,r)  is 
assumed  to  be  calculated  from  Eq.  (10).  where  I^(r)  is  the 
full  crystal  potential.  This  is  the  same  procedure  used  by 
Brown  and  Ciftan  (BC)."  The  original  Williams-Morgan^ 
(WM)  approach,  however,  used  the  truncated  potential 
y^  for  the  calculation  of  the  basis  function  in  Eq.  (10).  If 
expansions  of  the  potential  and  <t>i  in  Eq.  (10)  include  all 
the  angular-momentum  components,  both  approaches 

probably  will  give  the  same  results  for  the  band  structure, 
provided  both  converge."  In  practice,  the  expansion  is 
limited  to  a  certain  1^.^^',  therefore,  these  two  approaches 
yield  different  results. 

In  the  actual  calculation  of  the  basis  functions  using  ei¬ 
ther  y  or  y'.  we  first  write  the  potential  as  the  sum  of  y„ 
and  A  I',  where  I  „  is  the  spherical  part  of  the  potential 
and  AT  is  the  rest.  We  then  solve  for  the  radial  wave 
function  fi  corresponding  to  I'q.  Similarly,  the  basis  is 
written  as  <l>,  =  ^  A<t)^  ,  where  =/,  >'^ .  and  A<t>£  is 

solved  from  the  integral  equation 


To  test  the  accuracy  of  the  FP-KKR  equation  and  the 
convergence  in  angular  expansion  described  above,  we 
applied  the  theory  to  the  Mathieu  potential"'  of  the 
form 


y(r)=  -L'o 


2rrx  ,  2jrv  2rz 

cos - cos  — ^  +  cos  - — 

a  a  a 


i2‘’i 


where  we  took  the  lattice  constant  a  to  be  2 rr  times  the 
Bohr  radius  and  the  potential  parameter  L'g  to  be  0.5  Ry 
Because  the  potential  is  separable,  the  eigenvalue  prob¬ 
lem  reduces  to  three  one-dimensional  problems  The 
band  structures  and  corresponding  wave  functions  can  be 
computed  to  the  precision  of  the  computer  and  can  be  re¬ 
garded  as  “exact”  in  the  numerical  comparison. 

The  Mathieu  potential  is  poorly  represented  by  the  MT 
approximation,  because  the  simple  cubic  structure  is 
rather  open  and  the  potential  has  a  large  variation  in  the 


FIG.  1.  Angular-momentum  expansion  of  the  Mathieu  po¬ 
tential  along  [100],  [110],  and  [111]  The  solid  circles  represent 
the  continuous  crystal  potential  I'lrl,  and  the  solid  lines  are  the 
truncated  potential  I’'(rl.  The  dotted  and  the  dashed  lines  arc 
the  sums  of  the  angular-momentum  components  up  to  /  =  8  for 
I'lriand  l"tri.  respectively.  r„,r,  ,andr  are  the  distances  be¬ 
tween  the  origin  and  the  face.  edge,  and  corner  of  the  cube,  re¬ 
spectively  Nonce  that  the  dotted  line  and  the  solid  circles  are 
not  distinguishable  in  the  figure 
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interstitial  region.  For  example,  with  t'o  set  to  be  0.5  Ry, 
the  MT  constant  piotential  is  =  L'u(4/flr(6-jr)] 
^0.501  116  Ry,  while  the  actual  value  of  the  potential 
varies  from  -0.5  Ry  at  (4,0,0)a  to  1.5  Ry  at  (4,4,4)a. 

When  the  full  potential  in  Eq.  (27)  is  expanded  in  cubic 
harmonics  F(r)  =  2i  is  proportional 

to  —  U^jiilrtr /a),  and  the  series  converges  very  fast. 
With  an  —  one  can  achieve  a  converged  F(r),  as 
shown  in  Fig.  1.  However,  in  the  expansion  for  the  trun¬ 
cated  potential,  F’^(r)=  F{r)<7(r)  =  2i  F/(r)A:i(r),  ‘he 
components 

f  fii(r)y(r)cT(r)dr  (28) 

have  to  be  carried  out  numerically  with  great  care.  Be¬ 


cause  of  sharp  edges  and  corners  in  F^ir),  the  angular- 
momentum  expansion  is  only  slowly  converging.  This  is 
evident  in  Fig.  1,  which  shows  sizable  errors  made  in  all 
three  directions  [100],  [110],  and  [111]  in  the  expansion 
of  FUp  to/,^,,  =  8. 

We  have  carried  out  the  FP-KKR  calculation  using 
the  wave-function  expansion  in  Eqs.  (10)  and  (24)  up  to 
The  basis  sets  are  calculated  using  both  the  BC 
and  WM  approaches  with  the  potentials  expanded  up  to 
Results  from  the  MT-KKR  approximation  are 
also  obtained  for  comparison. 

In  Fig.  2(a),  the  solid  lines  represent  the  "exact”  band 
structures  for  the  Mathieu  potential.  The  dots  are  the 
MT-KKR  results.  Despite  the  crude  approximation  in 
the  MT  potential,  the  lowest  band  is  still  reasonable.  The 
MT  approximation  becomes  worse  at  the  higher  energies. 


FIG.  2.  Comparison  of  (a)  the  muffin-tin  KKR  and  (b)  the  FP-KKR  band  structures  (the  dots)  with  the  exact  results  (the  solid 
lines)  for  the  Mathieu  potential.  The  symmetry  points  T,  X,  M,  and  R  correspond  to  the  wave  vector  at  (0,0,0),  (  4 ,0,0),  (  4 ,  i ,  4  ),  and 
(  3,  j ),  respectively,  in  units  of  2tr/o. 
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as  indicated  by  the  large  energy  deviations  and  splittings 
of  the  levels.  For  example,  the  “exact”  bands  from  F  to 
A"  from  r  to  iVf  around  0.9  Ry  are  degenerate  due  to 
separability  of  the  Mathieu  potential,  while  the  MT  ap¬ 
proximation  lifts  this  “accidental"  degeneracy. 

The  full-potential  KKR  results  are  compared  with  the 
"exact”  band  structure  in  Fig.  2(b).  The  dots  are  now  the 
FP-KKR  results  and  are  calculated  based  on  the  BC  ap¬ 
proach.  The  agreement  is  excellent  and  rather  uniform 
up  to  1.3  Ry.  The  calculation  even  preserves  the  acciden¬ 
tal  degeneracy  at  F  at  energy  0.88  Ry.  The  lowest  band 
has  a  detectable  deviation  of  0.016  Ry  at  R,  but  has  very 
small  root-mean-square  (rms)  deviation.  The  deviations 
at  R  and  some  other  energy  states  are  probably  due  to  the 
truncation  in  the  angular-momentum  expansions.  The 
FP-KKR  bands  based  on  the  WM  approach  are  not  no¬ 


ticeably  different  from  those  based  on  the  BC  approach 
plotted  in  Fig.  2(b).  However,  there  are  slight  differences 
between  the  results  of  the  two  approaches.  For  reference, 
we  list  the  deviations  of  both  the  BC  and  WM  ap¬ 
proaches  and  the  “exact”  energies  in  Table  1.  While  the 
WM  approach  gives  a  larger  deviation  in  the  lowest- 
energy  band  around  R,  the  overall  rms  deviations  of  these 
two  approaches  are  similarly  small.  These  results  imply 
some  freedom  in  the  choice  of  basis  functions.  Prov  ided 
that  reasonable  approximations  are  made  in  the  represen¬ 
tation  of  the  cell  potential  and  in  the  calculation  of 
Si^i  and  from  Eqs.  (21)  and  (22),  it  appears  that  the 
FP-KKR  equation  will  give  reasonable  bands  indepen¬ 
dent  of  the  exact  algorithm  for  obtaining  the  e  g., 
from  F(r),  F^(r),  or  other  smooth  potentials  augmented 
to 


FIG.  2.  iContinuedK 
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TABLE  I.  Deviations  of  the  FP-KKR  band  energies  Af 
from  the  exact  values  for  the  Mathieu  potential  at  several 

symmetry  points.  The  subscripts  WM  and  BC  stand,  respective¬ 
ly,  for  the  W'llliams-Morgan  and  Brown-Ciftan  approaches  de¬ 
scribed  in  the  text.  All  energies  are  in  Ry. 


Symmetry 


states 

A£wvi 

^£bc' 

E 

r, 

-0.005.1 

-0.0034 

-0.3414 

r,. 

0.0243 

0.0223 

0.7517 

r, 

-0.0015 

0.0024 

0.8653 

r,: 

0.0034 

0.0069 

0.8653 

0.0274 

0.0160 

-0.0827 

0.0089 

0.0064 

0.4097 

R„ 

0.0156 

0.0145 

0.9020 

Rv 

0.0184 

0.0171 

1.3943 

0.0024 

0.0005 

-0.2551 

X, 

0.0214 

0.0216 

0.2372 

X. 

0.0060 

0.0056 

0.8379 

X, 

-0.0010 

-0.0021 

0.9515 

X. 

-0.0047 

-0.0036 

0.9515 

M, 

0.0016 

0.0002 

-0.1689 

M, 

0.0072 

0.0067 

0.3234 

0.0270 

0.0259 

0.8158 

M, 

0.0033 

0.0037 

0.9241 

0.0058 

-0.0005 

1.0377 

V.  SUMMARY 


functions  expanded  up  to  =4  and  the  potential  up  to 
/ma,  =  8,  the  FP-KKR  theory  as  described  above  gives  ex¬ 
cellent  results  for  the  bands  in  the  energy  range  needed 
for  solid-state  applications.  With  this  method,  one 
should  be  able  to  deal  with  solids  having  open  structures, 
such  as  semiconductors,  for  which  MT-KKR  is  not  suit¬ 
able. 
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APPENDIX: 

GREEN-FUNCTION  EXPANSION 

Here  we  want  to  show  that  Eq.  (6)  is  valid  when  Eq.  (7) 
is  satisfied.  Following  Kohn  and  Rostoker,'  we  separate 
G|(  of  Eq.  (5)  into  two  parts. 


The  main  purpose  of  this  paper  is  to  test  the  accuracy 
of  the  FP-KKR  theory  in  band-structure  calculations. 
To  help  eliminate  doubts  about  this  theory,  we  have  de¬ 
rived  the  FP-KKR  equation  explicitly  from  the  Kohn- 
Rostoker  integral  equation.’  This  FP-KKR  theory  still 
preserves  the  clear  separation  between  the  structural  and 
potential  information  possessed  in  the  MT-KKR  equa¬ 
tion.  The  potential  information  is  contained  in  the  5  and 
C  matrices,  which  can  be  easily  calculated  if  the  basis 
functions  and  potentials  are  expressed  in  angular- 
momentum  expansions.  Such  expansions  are  desirable  in 
a  realistic  calculation.  The  whole  procedure  has  been 
tested  against  the  exactly  soluble  Mathieu  potential  in  the 
simple-cubic  structure.  Because  of  the  openness  of  the 
structure  and  the  high  anisotropy  of  the  potential,  this 
potential  provides  a  challenging  model  to  test  against  any 
band-structure  theory.  Our  results  show  that  with  wave 


Gk(£;r,r')=go(k,£;r,r'H-g,(k,£;r,r')  , 


where  go  the  singular  part. 


go(k,£:r,r')  = 


1  expl/icir  — r'' ) 
4n-  Ir-r'l 


and 

g,(k,£.r,  r') 


J_ 

An 


1 

i  -o 


exp(  lie  !r  —  r'  —  Rj ) 
ir-r'-R,! 


exp(/k  R, )  . 


(Al) 


(A2) 


(A3) 


For  r  <r’  <R^  and  for  r  and  r'  inside  r,  the  first  part  has 
the  expansion  i  (kt' ),  where 

Hi^^Kr)  =  JilKT)  +  iS\{KT).  Under  the  same  condition  for 
r  and  r',  iri  <  ir'-FRj  also  holds  for  a  Wigner-Seitz  cell, 
so  that  the  following  expansion  is  valid:*’  ** 


expliKlr  — r'  —  R,  ’  I 

r-r -R, 


=  -  ik'^J  +  K, )) 

L 


L  L 


where 


)  =  4w2  ' 


(icR; )  , 


(A4) 


(A5) 


where  is  given  in  Eq.  (25).  Therefore  the  Green  function  has  the  expansion  in  Eq.  (6)  with  the  structure  constant 
given  by 

)  I  ■  (A6) 


£^^(k,£)=  —iK 
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Abstract.  Vacancies  are  known  to  play  an  important  role  in  the  Hg-based  narrow- 
gap  alloys  HgCdTe  and  HgZnTe  In  this  paper  we  summarise  our  recent 
calculations  of  the  vacancy  formation  energies  in  HgCdTe  and  HgZnTe  We  find 
that  the  vacancy  formation  energy  in  these  alloys  varies  non-linearly  with  alloy 
concentration,  resulting  in  higher  vacancy  densities  than  those  predicted  on  the 
basis  of  a  linear  variation  of  the  vacancy  energies.  Surface  segregation  in  these 
alloys  is  driven  by  bond  strength  and  bond  length  differences.  We  review  our 
recent  calculations  which  show  that  the  chemical  terms  dominate  in  HgCdTe  and 
result  in  Hg-rich  surfaces,  while  in  HgZnTe  the  strain  terms  contribute  as  well  and 
result  in  a  less  Hg-rich  surface,  suggesting  that  HgZnTe  surfaces  may  be  more 
amenable  to  surface  processes  such  as  passivation. 


1.  Introduction 

The  non-idealities  of  semiconductors  always  control  de¬ 
vice  properties;  this  is  true  to  an  exceptional  degree  in  the 
Hg-based  narrow-gap  alloys  (HgCdTe  and  HgZnTe).  In 
this  paper  we  review  our  recent  work  on  two  of  these 
important  non-idealities,  namely  vacancies  and  surface 
segregation.  Vucandcs  in  HgCdTe  are  responsible  for  the 
intrinsic  doping  in  the  material  and  their  abundance  has 
been  attributed  to  the  weak  HgTe  bond.  Sher  er  a/  ( 1 985) 
have  suggested  that  HgZnTe  may  be  a  better  candidate 
than  HgCdTe  for  infrared  device  applications  on  the 
basis  of  calculations  which  showed  the  HgTe  bond  to  be 
stronger  in  HgZnTe  than  in  HgCdTe,  implying  a  lower 
tendency  for  Hg  vacancy  formation.  Below  we  review  our 
calculations  (Berding  et  al  1 987,  Berding,  Chen  and  Sher, 
unpublished)  of  the  vacancy  formation  energies  in  these 
two  alloys.  The  surfaces  of  these  alloys  play  an  important 
role  in  the  various  device-processing  steps.  Surface  segre¬ 
gation  in  metal  alloys  is  a  well  known  phenomenon. 
Measurements  (Buck  1 982)  have  shown  that  the  compo¬ 
sition  of  the  surface  may  differ  from  that  of  the  bulk,  and 
such  segregation  is  also  expected  to  occur  in  the  semicon¬ 
ductor  alloys.  The  same  forces  that  drive  vacancy  forma¬ 
tion  in  these  alloys  also  drive  surface  segregation,  namely 
the  relative  strengths  of  the  HgTe,  CdTe  and  ZnTe  bonds 
in  the  various  alloy  (and  surface)  environments.  In  this 
paper  we  review  our  recent  work  in  this  area  (Patrick  et 
al  1 989). 
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2.  Vacancies 

The  calculation  of  the  properties  of  vacancies  in  semicon¬ 
ductors  is  a  difficult  problem  to  which  substantial  effort 
has  been  directed  with  various  degrees  of  success.  Most 
of  these  calculations  have  focused  on  the  localised  elec¬ 
tronic  levels.  In  HgCdTe  and  HgZnTe  we  are  interested 
in  the  relative  ease  with  which  the  vacancies  form  in  the 
materials,  and  how  the  vacancy  concentration  varies 
across  the  composition  range.  In  particular,  we  want  to 
know  whether,  for  an  alloy  concentration  corresponding 
to  a  given  band  gap.  vacancies  will  be  more  of  a  problem 
in  HgZnTe  or  in  HgCdTe.  To  answer  this  question,  we 
have  developed  a  tight-binding  model  of  the  vacancies  to 
calculate  the  vacancy  formation  energies  £,  in  the  semi¬ 
conductor  compounds  and  alloys.  This  model  is  sum¬ 
marised  below. 

For  the  compound  semiconductors  a  Green  function 
method  can  be  used  to  treat  the  vacancy  as  a  point  defect 
in  the  otherwise  perfect  lattice.  In  the  alloys  the  lattice  is 
not  perfect  and  is  in  general  disordered  owing  to  the 
presence  of  the  two  atom  species  located  on  the  cation 
sublattice.  Because  of  this  disorder,  an  embedded  cluster 
model  of  the  vacancy  is  preferable  for  the  alloys,  so  that 
the  near-alloy  environment  of  the  vacancy  can  be  mo¬ 
delled  explicitly  while  the  far  environment  can  be  treated 
approximately  (e.g.  in  the  virtual  crystal  approximation 
or  VCA). 

We  have  calculated  £,  for  ZnTe,  CdTe  and  HgTe 
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using  a  tight-binding  model  based  on  Harrison's  (1981) 
parameters  with  the  repulsive  interaction  energy  selected 
to  give  correct  agreement  with  the  experimental  cohesive 
energies  in  the  compounds.  The  final  state  for  the  re¬ 
moved  atom  in  free  space  has  been  used  as  a  reference, 
although  various  other  final  states  can  also  be  calculated. 
Atom  clusters  with  up  to  second-,  third-  and  fourth- 
neighbour  shells  of  atoms  were  used,  and  each  was 
coupled  to  an  extended  bulk  using  perturbation  theory  at 
the  cluster  boundary.  Clusters  are  centred  at  the  vacancy 
formation  site,  and  £,  is  calculated  from  a  difference  in 
total  energies  of  the  cluster  before  and  after  vacancy 
formation. 

The  Green  function  method  was  also  used  to  calcu¬ 
late  total  energies  and  £,  for  the  compounds,  using  the 
same  Hamiltonian.  For  the  cation  vacancy  the  cluster 
calculations  converge  very  quickly  to  the  Green  function 
(effectively  an  infinite-cluster  result).  For  the  anion  va¬ 
cancy  the  convergence  is  not  as  fasi,  an  effect  attributed 
to  the  more  extended  nature  of  the  defect  states  produced 
by  the  anion  vacancy.  The  relative  magnitude  of  the 
anion  or  cation  vacancy  formation  energies  for  most 
compounds  is  not  changed  from  the  cluster  to  the  Green 
function  result,  so  comparisons  among  the  various  com¬ 
pounds  should  not  change.  Thus  we  can  use  the  clusters 
to  calculate  £,  in  the  alloys,  with  a  fixed  correction  across 
the  composition  range  to  account  for  the  cluster  trunca¬ 
tion. 

The  vacancy  formation  energies  in  the  alloys  are 
calculated  in  a  similar  manner,  with  differences  of  cluster 
total  energies  with  and  without  a  vacancy  used  to 
calculate  £,.  The  alloy  environment  for  the  first-  and 
second-nearest  neighbours  is  treated  explicitly,  with  va¬ 
cancy  formation  energies  being  calculated  for  the  various 
particular  arrangements  of  cations.  For  example,  for  the 
Te  vacancy  in  HgCdTe.  £,  for  five  particular  near¬ 
neighbour  environments  of  the  Te  are  calculated: 
HggCdi.  Hg,Cdj,  HgjCd-,  Hg3Cd,  and  Hg^Cdp.  For 
the  cation  vacancy.  £,  for  the  various  cation  arrange- 


2  c  6  a  10  2  <•  6  a  10  12 


Figure  1.  Extraction  energies  in  the  Te  common-anion 
alloys  as  a  function  of  near-neighbour  configuration  for 
bulk  (vca)  concentration  x  =  0.5  The  full  curves  correspond 
to  the  average  extraction  energies  and  the  broken  curves 
to  the  RMS  deviations. 


ments  in  the  second-nearest-neighbour  shell  must  be 
considered.  For  both  cation  and  anion  vacancies  the 
alloy  environment  in  the  third  shell  and  beyond  is 
modelled  in  the  vc.x. 

Results  for  £,  against  the  first-  or  second-nearest- 
neighbour  composition  are  shown  in  figure  I  The  effects 
of  the  alloy  environment  are  most  dramatic  for  the  anion 
vacancy,  as  expected,  because  the  cation  substitution 
occurs  in  the  first-neighbour  shell  about  the  vacancy.  A 
minimum  occurs  in  £,  for  Te  in  HgCdTe  and  HgZnTe 
and  arises  from  the  occupation  of  the  defect  states,  which 
in  this  one-electron  picture  lie  at  higher  energies  in  ZnTe 
and  CdTe  than  in  HgTe.  Thus  in  clusters  containing  at 
least  one  Hg  in  the  first-neighbour  shell  the  defect 
electrons  will  occupy  the  lower-lying  Hg-like  defect  state 

The  effect  of  the  alloy  environment  is  less  for  the 
cation  than  for  the  anion  vacancy.  For  all  cations,  £,  is 
found  to  decrease  as  one  goes  to  Hg-rich  clusters  in 
HgCdTe  and  HgZnTe.  Earlier  calculations  (Sher  et  ul 
1985)  based  on  bond  strength  modifications  in  the  alloys 
predicted  that  Hg  vacancies  will  occur  more  readily  in 
HgCdTe  than  in  HgTe.  in  contrast  to  the  present  predic¬ 
tions.  The  current  results  show  that  the  simple  bulk  bond 
strength  arguments  are  insufficient  to  predict  trends  in 
£,,  because  they  do  not  include  the  effects  of  back-bond 
strength  modifications  due  to  the  vacancy  Thus,  al¬ 
though  the  HgTe  bond  may  be  weakened  in  the  alloy, 
when  an  Hg  vacancy  is  formed,  adjacent  bonds  streng¬ 
then,  modifying  £,  in  [he  alloy. 

The  vacancy  concentrations  in  the  random  alloys 
have  been  calculated  from  the  above  results  and  are 
shown  in  figure  1.  Shown  for  comparison  are  the  vacancy 
concentrations  we  get  if  we  assume  a  linear  variation  of 
£.  with  near-neighbour  environment,  i.e.  a  straight-line 
variation  between  the  end  points  in  figure  1.  For  any 
given  alloy  concentration  .x.  an  appropriate  probability - 
weighted  average  over  the  clusters  is  taken  to  calculate 
the  results  in  figure  2.  From  figure  2  one  sees  that  the 
effects  of  the  non-linearity  of  £,  on  vacancy  concentra¬ 
tions  are  quite  dramatic,  even  for  the  cation  vacancy.  In 
particular,  our  results  predict  higher  concentrations  of 
vacancies  in  the  alloys  than  one  would  expect  on  the 
basis  of  simple  arguments  (i.e.  a  linear  interpolation  of  £, 
from  the  compounds).  We  emphasise  here  the  overall 
behaviour  observed  in  figures  1  and  2.  because  the 
magnitude  of  £,  and  the  vacancy  concentrations  will 
depend  on  the  final  states  available  to  the  atoms  forming 
the  vacancies. 

Before  concluding,  we  note  that  vacancy  formation 
energies  for  final  states  of  the  cations  on  the  (11  DA 
surface  of  the  compound  can  be  estimated  if  one  approxi¬ 
mates  the  (lll)A  localised  states  before  and  after  an  A 
atom  is  added  to  it,  by  similar  defect  states  at  the  anion 
and  cation  vacancy  sites.  A  difference  of  the  vacancy 
formation  energies  for  the  free  atom  final  state  and  the 
(1 1 1)A  surface  final  state  yields  an  estimate  of  the  subli¬ 
mation  energy  of  an  A  atom  from  the  (lll)A  surface. 
These  numbers  are  related  to  the  energies  necessary  to 
calculate  the  surface  segregation  in  HgCdTe  and 
HgZnTe. 
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Figure  2.  Relative  vacancy  concentrations  in  HgCdTe  and 
HgZnTe  The  full  curves  correspond  to  results  using  in 
figure  1 ;  the  broken  curves  are  based  on  a  linear 
interpolation  from  for  the  compounds. 


Lay«r 

Figure  3.  Layer  concentrations  al  the  growth  temperature 
of  973  K  for  HgQgQCdQjo^®  and  f~f9o8s^^o  15^® 


Hg)  to  segregate  to  the  surface.  In  lattice-mismatched 
alloys,  strain  energies  can  also  drive  surface  segregation, 
because  the  removal  of  a  mismatched  atom  to  the  surface 
results  in  a  lowering  of  the  strain  energy  in  the  bulk.  The 
strain  energy  release  would  be  expected  to  be  the  largest 
for  the  dilute  species  segregating  to  the  surface.  In  lattice- 
matched  HgCdTe.  only  the  chemical  energies  contribute 
significantly  to  the  segregation,  while  in  HgZnTe  both 
strain  and  chemical  energies  contribute. 

The  surface  concentration  profile  is  calculated  (Pa¬ 
trick  ei  al  1989)  by  equating  the  chemical  potential 
(defined  as  the  first  derivative  of  the  free  energy  F  with 
respect  to  concentration)  in  each  layer,  because  at  equi¬ 
librium  the  chemical  potential  in  each  layer  must  be 
equal.  The  layers  are  coupled  to  one  another  through  the 
configuration  entropy  contribution  to  F.  Calculations 
were  done  within  the  quasi-chemical  approximation 
(Kumar  et  al  1979,  Kumar  1981)  m  which  pair  interac¬ 
tions  only  are  included,  and  the  regular  solution  model 
(Williams  and  Nason  1974)  in  which  randomness  is 
assumed  within  each  layer 

Surface  segregation  profiles  were  calculated  for  the 
(1 11)A  surface  of  HgCdTe  and  HgZnTe.  Results  corre¬ 
sponding  to  the  equilibrium  for  the  ideal  (1 1 1)A  surfaces 
at  973  K  are  shown  in  figure  3.  Results  shown  are  for 
bulk  concentrations  of  HgogjZnoijTe  and 
Hgo  soCdo  20^^  corresponding  to  the  same  band  gap.  A 
larger  surface  segregation  is  found  in  HgCdTe  than  in 
HgZnTe,  although  for  both  materials  at  this  high  temper¬ 
ature  the  concentration  decays  to  the  bulk  value  within 
four  atom  layers  (a  layer  consisting  of  an  anion  and 
cation  pair  of  atom  planes).  For  both  alloys  the  surfaces 
are  found  to  be  Hg-rich,  as  expected  on  simple  chemical 
bond  strength  arguments.  In  HgZnTe  the  strain  contri¬ 
bution  tends  to  drive  Zn,  the  minority  species,  to  the 
surface  Thus  in  HgCdTe  the  chemical  energies  alone 
drive  Hg  to  the  surfaces,  while  in  HgZnTe  the  chemical 
and  strain  energies  oppose  one  another,  lessening  the 
magnitude  of  the  Hg  surface  concentration  enhancement. 

Calculations  are  in  progress  for  the  surface  segrega¬ 
tion  on  the  (110)  cleavage  plane  and  the  (1(X))  surface, 
which  is  important  in  epitaxial  grow'h.  On  the  basis  of 
bond  strength  arguments  alone,  one  would  expect  the 
surface  segregation  to  be  larger  on  the  (100)  growth 
surface  than  on  the  ( 1 1 1 ) A.  because  of  the  fewer  bonds 
made  by  the  surface  atoms  (the  cations  on  the  (lll)A 
surface  make  three  bonds,  while  on  the  (100)  only  two 
surface  bonds  are  made). 


3.  Surface  segregation 

Two  major  contributions  drive  surface  segregation  in  the 
semiconductor  alloys.  Chemical  energies  arise  from  dif¬ 
ferences  in  bond-breaking  energies  of  the  constituent 
compounds.  This  can  drive  surface  segregation,  because 
fewer  bonds  are  made  by  atoms  at  the  surface  than  in  the 
bulk,  and  one  expects  the  less-weli-bound  species  (e.g. 


4.  Conclusions 

The  above  results  demonstrate  some  of  the  complexities 
encountered  when  dealing  with  the  semiconductor  alloys. 
The  vacancy  formation  energies  are  found  to  exhibit  a 
non-linear  variation  with  alloy  concentration,  resulting 
in  larger  vacancy  concentrations  in  the  alloys  than 
otherwise  expected.  Similar  effects  in  other  Il-VI  and 
III-V  alloys  have  been  predicted  (Berding,  Chen  and 
Sher.  unpublished).  Surface  segregation  has  been  found 
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to  a  greater  extent  for  the  ( 1 1  l)A  surface  in  HgCdTe  than 
in  HgZnTe,  indicating  that  HgZnTe  may  pose  fewer 
problems  in  the  surface-related  properties  and  processing 
steps.  Results  from  calculations  currently  in  progress  on 
the  (110)  and  (100)  surfaces,  and  including  a  vapour 
phase,  will  more  fully  answer  the  question  of  surface 
segregation  on  technologically  important  surfaces. 
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Extraction  energies  for  diamond  and  zinc-blende  semiconductor  compounds  and  pseudobmary 
alloys  are  calculated  using  a  tight-binding  cluster  method,  where  the  final  state  of  the  removed 
atom  is  in  a  free-atom  state.  The  extraction  energies  provide  a  convenient  reference  from  which 
other  final  states  of  the  removed  atoms  can  be  calculated.  In  the  elemental  and  compound 
semiconductors,  the  convergence  of  the  cluster  calculation  was  verified  using  a  Green’s 
function  calculation  with  the  same  Hamiltonian.  For  the  elemental  semiconductors,  vacancy 
(or  Schottky  defect)  formation  energies,  in  which  the  final  state  of  the  removed  atom  is  on  the 
surface,  have  been  calculated.  For  pseudobinary  alloys  of  the  form  ^ ,  B,  C,  we  find 
extraction  energies  to  be  very  sensitive  to  the  local  environment,  exhibiting  a  nonlinear 
variation  between  the  A-  and  5-rich  local  environments;  the  nonlinearity  is  especially 
pronounced  for  the  removal  of  a  C  atom.  Nonlineanties  are  found  to  arise  primarily  from  the 
occupation  of  localized  vacancy  states.  The  impact  that  these  alloy  variations  will  have  on 
measurable  properties  are  discussed. 


I.  INTRODUCTION 

Knowing  the  ease  with  which  vacancies  form  in  semi¬ 
conductor  compounds  and  alloys  is  essential  to  understand¬ 
ing  many  properties  of  these  materials  and  the  way  they  re-" 
spond  to  device  processing.  For  example,  the  deep  levels 
often  associated  with  vacancies  can  be  detrimental  to  device 
performance.  Unfortunately.  limited  experimental  values 
exist  for  the  vacancy  formatior.  energies  in  semiconductors, 
and  little  effort  has  been  directed  toward  understanding  the 
variations  of  the  vacancy  formation  energies  in  the  alloys. 
Interpretation  of  such  experiments  is  complicated  because, 
in  addition  to  the  various  types  of  vacancies  possible  (anion, 
cation,  and,  in  alloys,  particular  environments  for  the  anion 
or  cation ) ,  the  presence  of  other  native  defects,  such  as  inter¬ 
stitials,  antisites,  and  impurities,  also  contribute  to  the  mea¬ 
sured  quantities  such  as  carrier  concentrations  and  diffu¬ 
sion.  Thus  vacancies  are  rarely  truly  isolated  defects  in 
crystals  and  are  influenced  by  the  presence  of  other  defects 
and  the  compound  stoichiometry.  In  addition,  diffusion 
measurements  which  have  been  used  to  deduce  the  vacancy 
formation  energy  are  complicated  by  contributions  from  the 
migration  energy,  as  well  as  by  interstitial  diffusion  and  pos¬ 
sible  diffusion  paths  along  extended  defects,  such  as  disloca¬ 
tions.  Because  of  this,  the  magnitude  of  vacancy  formation 
energies  in  many  semiconductor  compounds  is  not  well  es¬ 
tablished  experimentally. 

Many  efforts  have  been  directed  at  understanding  the 
important  electronic  properties  of  vacancies  and  deep  states 
in  semiconductors.'"’*  Most  of  these  calculations  have  fo- 
cu  •  d  on  the  localized  states  produced  by  the  vacancy  using 
cluster  and  slab  methods.'""  Fewer  models  have  been  used 
to  calculate  formation  in  semiconductors."  '*’ 

In  this  paper  we  present  a  model  for  the  calculation  of 
the  extraction  and  vacancy  formation  energies  in  diamond- 


cubic  and  zinc-blende  semiconductors.  We  use  a  cluster 
method  based  on  Harrison’s  tight-binding  theory.'’  Ham- 
son’s  theory  has  been  found  to  give  semiquantitative  agree¬ 
ment  with  experiment,  yielding  proper  trends  in  structure- 
related  properties  of  pure  semiconductor  compounds  and 
their  alloys.The  elimination  of  surfacelike  states  associated 
with  the  cluster  boundary  have  been  addressed  by  including 
only  complete  bonds  in  the  cluster,  thereby  allowing  atoms 
w'ith  one  or  two  hybrid  orbitals  missing  at  the  cluster  edge, 
and  then  coupling  the  cluster  states  to  bond  orbitals  outside 
ofthe  cluster  by  using  second-order  perturbation  theory.  We 
have  included  in  this  study  ( 1 )  a  verification  of  the  conver¬ 
gence  with  cluster  size  using  a  Green’s  function  method;  ( 2 ) 
an  application  to  both  the  anions  and  cations  in  the  com¬ 
pound  semiconductors;  ( 3 )  a  systematic  comparison  among 
the  various  group-lV,  -ITI-V,  and  -II-VI  semiconductors; 
and  (4)  an  extension  to  the  pseudobinary  alloys  to  study  the 
explicit  effect  of  alloy  composition,  by  directly  considering 
the  various  possible  clusters  of  atoms  about  the  vacancy  site. 
Such  a  detailed  examination  of  the  dependence  of  the  extrac¬ 
tion  energy  on  the  alloy  environment  cannot  be  easily  ob¬ 
tained  in  a  Green’s  function  method  using  an  effective-medi¬ 
um  theory.  Because  of  the  difficulty  associated  with 
correctly  predicting  the  energy  positions  of  the  deep  levels  in 
the  gap,  no  attempt  is  made  to  do  so  here.  Additionally,  only 
the  zero-temperature,  ground-state  energies  have  been  cal¬ 
culated.  Preliminary  results  of  this  work  have  been  pub¬ 
lished  previously."*  ’' 

The  remainder  of  this  paper  is  organized  as  follows.  In 
Sec.  II  we  briefly  discuss  the  method  used  to  calculate  the 
extraction  and  vacancy  formation  energies  in  the  diamond 
and  zinc-blende  semiconductors.  The  extension  of  this 
method  to  the  alloys  is  given  in  Sec.  III.  Results  for  both 
compounds  and  alloys  are  presented  in  Sec.  IV,  and  conclud¬ 
ing  remarks  are  presented  in  Sec.  V. 
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II.  bA I HAU I  lUN  ANU  VACANCY  FORMATION 
ENERGIES  IN  SEMICONDUCTORS 

The  method  we  use  is  a  tight-binding  model,  based  on 
Harrison’s  universal  parameters*'  and  term  values  taken 
from  Chen  and  Sher.*'  We  begin  by  calculating  the  extrac¬ 
tion  energy,  defined  as  the  energy  necessary  to  remove  an 
atom  from  a  bulk  lattice  site  to  a  free-atom  state,  leaving 
behind  an  isolated  bulk  vacancy.  The  vacancy  is  described  by 
using  a  cluster  of  atoms  surrounding  the  vacancy  site.  A  sp' 
hybrid  basis  is  used,  and  atoms  at  the  edge  of  the  cluster  are 
truncated  so  that  no  dangling  bonds  are  included  The  bonds 
at  the  cluster  edge  are  coupled  to  the  infinite  crystal  using 
second-order  perturbation  theory.  Cation-  or  anion-cen¬ 
tered  clusters  are  used  for  the  calculation  of  their  respective 
e.xtraction  energies.  Total  electronic  energies  of  a  cluster, 
with  and  without  the  center  atom  removed,  are  calculated 
from  a  sum  of  one-electron  energies,  obtained  from  the  clus¬ 
ter  diagonalization.  The  extraction  energy  is  then  calculated 
from 

£,=(£,-£,),  (1) 

where  £^  is  the  total  energy  of  the  cluster  with  the  center 
atom  removed  plus  the  removed  atom  in  a  free-atom  state, 
and  £  is  the  total  energy  of  the  cluster  before  the  removal  of 
the  central  atom.  The  cluster  total  energies  include  a  contri¬ 
bution  from  the  hybrid  overlap  interaction  in  each  bond:  this 
overlap  energy  K,,,  is  chosen  for  each  material  so  that  energy 
per  bond  for  the  initial  cluster  agrees  with  experiment.  Con¬ 
vergence  of  the  extraction  energies  with  cluster  size  was  veri¬ 
fied  using  a  Green’s  function  calculation. *■'  in  which  the  va¬ 
cancy  is  simulated  by  setting  the  term  values  at  the  vacancy 
site  to  infinity. 

Several  additional  contributions  to  E ^  were  also  esti¬ 
mated.  First,  the  charge  redistribution  that  occurs  when  an 
atom  is  removed  from  the  cluster  produces  a  shift  in  both  the 
Madelung  energy  C^K  and  the  on-site  Coulomb  energy  AC/. 
The  calculation  of  these  energies  is  discussed  in  Appendix  A. 
Second,  the  atoms  about  the  vacancy  site  may  move  from 
their  ideal  lattice  positions  as  a  result  of  the  formation  of  a 
vacancy,  relaxing  into  the  minimum  energy  configuration. 
The  calculation  of  the  energy  resulting  from  radial  relaxa¬ 
tion  about  the  vacancy  site  is  given  in  Appendix  B.  Finally, 
the  second-neighbor  interaction  of  the  dangling  hybrids  at 
the  vacancy  site  and  the  related  Jahn-Teller  distortion  can 
lower  the  vacancy  formation  energy.  An  estimate  of  this  cor¬ 
rection  is  given  in  Appendix  C. 

Vacancy  formation  or  Schottky  defect  formation  ener¬ 
gies  are  calculated  from,  in  the  elemental  semiconductors, 

£,  =£,  -£,„h.  (2) 

or  in  compound  semiconductors  as 

-I- £'.  )  -  £,.,h.  (3) 

where  is  the  bulk  cohesive  energy  per  atom.  Equation 
( 2 )  corresponds  to  the  removal  of  an  atom  from  a  bulk  site  to 
a  surface,  leaving  a  bulk  vacancy  behind,  and  thereby  in¬ 
creasing  the  number  of  sites  of  the  solid  by  one  while  leaving 
the  nature  of  the  surface  effectively  unchanged.  For  the  com¬ 
pound  semiconductors,  Eq.  (3)  corresponds  to  the  creation 

5065  J  AppI  Phys  .  Vol  68.  No  10,  1 5  November  1990 


of  an  unbound  cation-anion  vacancy  pair  and  the  addition  of 
a  unit  cell  to  the  crystal,  here  normalized  per  atom.  We  can¬ 
not  consider  the  surface  to  be  unchanged  after  the  removal  of 
only  an  anion  or  cation  from  the  bulk  to  the  surface,  because 
in  doing  so  the  surface  stoichiometry,  by  necessity,  will  be 
changed.  Another  way  to  visualize  this  is  to  realize  that  the 
cohesive  energy  appearing  in  Eq.  (2)  can  only  be  defined  per 
cation-anion  pair  in  a  compound,  and  the  cohesive  energy 
per  anion  or  per  cation  alone  cannot  be  generally  defined 
Thus,  in  compound  semiconductors,  additional  informa¬ 
tion,  such  as  stoichiometry  and  other  dominant  defects, 
must  be  supplied  to  calculate  the  vacancy  concentration 
Hence,  while  energies  £““’  =E‘"^‘  '  —  £,.,h  can  be  algebrai¬ 
cally  defined,  they  are  not  generally  physically  meaningful  in 
the  sense  that  they  correspond  to  a  specific  event,  nor  will 
they  alone  in  statistical  equations  predict  measurable  quanti¬ 
ties.  For  similar  reasons,  the  Schottky  defect  formation  ener¬ 
gies  in  the  alloys  are  not  rigorously  defined. 

III.  EXTRACTION  ENERGIES  IN  ALLOYS 

Extraction  energies  in  the  pseudobinary  semiconductor 
alloys  are  calculated  in  a  manner  similar  to  the  elemental 
and  compound  semiconductors.  Beyond  the  cation  and  an¬ 
ion,  there  are  two  distinct  classes  of  vacancies  which  we  must 
consider:  the  removal  of  an  atom  of  the  common  species  and 
the  removal  of  an  atom  of  the  substituted  species.  Below  ,  we 
briefly  discuss  the  calculations  for  these  two  classes. 

We  begin  by  discussing  the  extraction  energy  for  an 
atom  of  the  common  species,  i.e.,  a  C  atom  in  the  pseudobin¬ 
ary  alloy  A,  ,  C,  where  C  can  be  either  a  cation  or  anion 
Consider  the  four  first-neighbor  positions  occupied  by 
{A  —  n)  A  atoms  and  n  B  atoms,  where  n  =  0-4.  Because  of 
the  tetrahedral  symmetry,  there  is  only  one  unique  arrange¬ 
ment  for  the  (4  —  )  /I  and  ft  B  atoms.  Because  the  number 
of  possible  arrangements  of  ,4  and  B  atoms  in  the  third-shell 
neighbor  sites  and  beyond  is  large,  and  because  we  expect  the 
primary  effects  of  alloying  on  the  extraction  energy  to  have  a 
short  range,  we  use  a  virtual  crystal  average  (VCA)  for  the 
medium  beyond  the  second  shell. 

For  an  alloy  of  a  given  concentration  x,  the  average 
bond  lengths  of  ihe  AC  and  £C  bonds  in  the  alloys  have  been 
shown  to  be  well  represented  by*’ 

d,c  =d'\,  +x(d"„c  -d'\c)/4,  (4a) 

and 

dHc=d''„c  -  (1  -x)(d'’c  -t£’c)/4.  (4b) 

where  d"<-  and  d"„^-  are  the  bond  lengths  of  the  constituent 
compounds.  This  is  also  in  excellent  agreement  with  ex¬ 
tended  x-ray  absorption  fine-structure  (EXAFS)  experi¬ 
ment.  Because  in  a  random  alloy  the  deviation  of  the  bond 
lengths  in  various  classes  of  clusters  about  this  value  is 
small,*'’  we  have  used  bond  lengths  calculated  from  Eq.  (4) 
for  all  classes  of  clusters  (i.e.,  values  of  «).  Because  the 
breathing-mode  lattice  relaxation  about  the  vacancy  site  was 
found  to  be  small  for  elemental  and  compound  semiconduc¬ 
tors  (see  Sec.  IV),  lattice  relaxation  about  the  vacancy  in  the 
alloys  has  been  ignored. 

For  a  given  bulk  concentration  x,  the  extraction  energy 
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of  a  C  atom  from  a /<  4  cluster, (jc.n ).  is  calculated 
by  taking  the  difference  between  the  final-  and  initial-state 
cluster  energies  with  the  same  configuration  of  A  and  B 
atoms.  In  a  random  alloy  /< ,  ,  C,  the  average  extraction 

energy  of  a  C  atom  is  given  by 

(JT)  =  ^  £,,£1  (Jt.n).  (5) 

II  -  11 

where  P„  is  the  random  probability  of  a  cluster  with  n  B 
atoms,  given  by 

£„=A,x"(1-x)-  ",  (6) 

and  where  p„  is  the  binomial  coefficient 


The  above  expression  for  £  ^  (at)  can  be  readily  extended  to 
include  nonrandom  cluster  distributions  by  replacing  P„ 
with  the  appropriate  nonrandom  probabilities.  Note  that  it  is 
the  particular  values  of  £,  {x,n )  which  enter  into  the  calcu¬ 
lation  of  many  physical  properties,  not  the  average  calculat¬ 
ed  in  Eq.  (5),  as  shall  be  discussed  below. 

We  now  consider  a  B  vacancy  in  the  alloy  A,  ,  C.  To 
include  specific  alloy  arrangements,  one  must  consider  the 
G„  unique  ways  of  arranging  (12  -  «)  A  atoms  and  n  B 
atoms  in  the  second-nearest-neighbor  positions,  where 
n  =  Q>~\2.  Because  of  couplings  to  other  second-neighbor 
sites  through  the  third-neighbor  atoms,  all  second-neighbor 
sites  are  not  equivalent,  and  in  general  G„  is  large.  Thus  we 
approximate  by  assuming  the  cluster  is  a  Bethe  lattice,  there¬ 
by  decreasing  the  connectivity  of  the  lattice  and  subsequent¬ 
ly  the  number  of  unique  arrangements  to  G  ‘„.  Because  the 
number  of  possible  arrangements  of  A  and  B  atoms  in  the 
fourth-neighbor  sites  and  beyond  is  large,  and  because  the 
detailed  arrangement  of  the  atoms  in  lhe.se  positions  should 
not  significantly  affect  the  extraction  energy,  a  VCA  average 
beyond  the  third  shell  is  used. 

The  extraction  energy,  E  ''{x.n.g,,  }.  for  each  arrange¬ 
ment  g„  of  n  B  atoms  and  {]2  —  n)  A  atoms  in  the  second- 
neighbor  positions  is  computed  by  taking  the  difference 
between  the  initial-  and  final-state  cluster  energies.  The  aver¬ 
age  extraction  energy  for  a  given  concentration  x  is  ex¬ 
pressed  by 

E':(x)  =  2 

„  <1  K.  I 

+  En.,uAx),  (8) 

where  P„  is  the  random  probability  of  a  -'luster  with  n  A 
atoms,  given  by 

P„=p„x"{  \  -X)'-  ",  (9) 

and  p„  is  the  binomial  coefficient 


is  the  probability  of  the  g'„'’  particular  arrangement  of 
(12  —  n)  A  atoms  and  n  B  atoms  in  the  second-neighbor 
sites  of  the  cluster.  Because  it  is  well  known  that  the  proper¬ 
ties  of  solids  are  poorly  represented  by  approximations  such 
as  the  Bethe  tree,  we  do  not  focus  on  the  quantitative  values 


in  presenting  the  results  for  a  vacancy  for  the  alloys,  but 
rather  on  the  overall  trends  observed  correction 
fiuih.  added  to  the  values  calculated  using  the  Bethe 

approximation  to  produce  agreement  with  the  non-Bethe  re¬ 
sults  for  the  compounds  The  correction  in  the  alloy  is  calcu¬ 
lated  by  first  computing  it  for  the  pure  compounds.  AC  and 
£C.  e  g  .  £  =  £,^‘ (zinc  blende)  —  £,■“  (Bethe  lattice), 

using  the  same  cluster  size,  then  interpolating  to  calculate 
the  correction  in  the  alloy,  via  £,K,h,  A)  =  ( 1  —  Jf  )£h^,i„ 
-1- (Jt)£^‘„,^ .  This  correction  is  typically  <0.3  eV.  As 
above,  these  expressions  can  be  readily  extended  to  include 
nonrandom  cluster  distributions. 

IV.  RESULTS 
A.  Extraction  energies 

Extraction  energies  £  for  anions  and  cations  in  the 
group-lV,  -111-V,  and  -II-VI  semiconductors  were  calculat¬ 
ed  for  cluster  sizes  from  13  to  59  atoms  and  compared  with 
the  results  of  a  Green’s  function  calculation  using  the  same 
matrix  elements.  Convergence  to  the  Green’s  function  val¬ 
ues  for  the  cation  vacancies  was  better  than  for  the  anion 
vacancies.  This  was  found  to  be  due  to  the  differences  in  the 
nature  of  the  states  associated  with  the  dangling  hybrids  at 
the  vacancy  site  The  localized  states  at  the  cation  vacancy 
site  arise  primarily  from  the  anion  dangling  hy  brids,  while 
the  states  at  the  anion  vacancy  site  arise  primarily  from  the 
cation  dangling  hybrids.  The  anion  hybrid-derived  states  are 
more  valence  band  like  and  thus  more  localized:  in  contrast 
the  cation  hybrid-derived  states  are  more  conduction  band 
like  and  more  delocalized:  thus  finite  cluster  effects  are  more 
important.  Calculated  extraction  energies  using  the  Green's 
function  are  given  in  Table  1  Also  show  n  for  comparison  are 


TABLE  I  Exiraclion  and  xacancy  formal  ion  energies  for  zine-blende  semi- 
conductors  The  vacancy  formation  energy  for  ihe  compounds  are  normal¬ 
ized  per  atom  Also  shown  for  comparison  arc  ihe  cohesive  energy  per  atom 
in  Ihe  ideal  solids 


Compound 

f...,.  -  cV 

E  -eV 

£  ■  -  eV 

1 

< 

C 

"..'6 

19.2 

19.2 

119 

Si 

4  64 

10.3 

10.3 

5.7 

Ge 

3  88 

8  1 

8  1 

4.2 

Sn 

3.12 

6.3 

6.3 

3  2 

AlP 

4  26 

106 

111 

fc  ^ 

GaP 

3.56 

8  3 

8  8 

5  0 

InP 

3  48 

8.2 

8  8 

5.1 

AlAs 

3  78 

*  8 

0  6 

5  4 

GaAs 

3.26 

6.7 

7  8 

4  0 

InAs 

3.10 

6  8 

7.5 

40 

AlSb 

2.40 

4.7 

6  1 

3  0 

GaSb 

2  96 

5.9 

6.8 

3,4 

InSb 

2  80 

60 

6.3 

3.3 

ZnS 

3.18 

6.9 

10.0 

5.3 

CdS 

2  84 

6  1 

8.8 

4.6 

HgS 

2.04 

4.5 

7.8 

4  1 

ZnSe 

2.58 

5,7 

8.7 

4  6 

CdSe 

2.42 

5,0 

7,8 

4.0 

HgSe 

1  70 

3.4 

6.8 

3.4 

ZnTe 

2  40 

5.1 

8.1 

4  2 

CdTe 

2.20 

4.7 

7,4 

3  8 

HgTe 

1.64 

3.1 

6.4 

3  1 
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the  cohesive  energies  of  the  ideal  solids.  We  see  that  estinutt- 
ed  errors  attributable  to  the  cluster  approximation  are  S  0. 1 
eV  for  the  cation  vacancies  and  S  0.5  eV  for  the  anion  vacan¬ 
cies.  Similar  errors  are  e.xpected  in  the  alloy  cluster  calcula¬ 
tions. 

Although  the  major  contnbui.on  to  E ,  is  from  break¬ 
ing  four  bonds  at  the  vacancy  site,  there  are  additional  terms 
that  contribute  to  E ,  and  that  differ  for  the  cation  and  anion 
vacancies.  These  differences  can  be  attributed  to  several 
sources.  First,  when  an  anion  is  removed  from  the  solid,  the 
electrons  on  the  removed  anion  are  depromoted  from  the5/»' 
hybrids,  while  the  electrons  on  the  four  cations  adjacent  to 
vacancy  remain  in  e.xcited  hybrid  states,  when  a  cation  is 
removed,  it  is  the  cation  electrons  that  are  depromoted  F  om 
the  hybrids,  while  the  anion  orbitals  remain  hybridized.  Be¬ 
cause  the  bond-breaking  term  2£^„,,  is  referenced  to  the  free 
cation  and  anion,  the  energy  of  the  electrons  which  remain 
promoted  to  sp'  hybrids  must  be  included  explicitly.  This 
contribution  to  is  larger  for  the  cation  vacancy  because 
both  the  number  of  electrons  and  the  promotion  energy  per 
electron  is  greater  for  the  four  anions  at  the  vacancy  site.  A 
second  difference  in  £ ,  arises  from  metallization-induced” 
shifts  in  the  dangling  hybrid  energies.  This  shift  is  upward 
for  the  anion  hybrids  and  downward  for  the  cation  hybrids, 
yielding  net  positive  (negative)  contribution  to  £,  for  the 
cation  (anion)  vacancy.  Finally,  a  third  difference  in  £^ 
arises  from  back-btmd  energy  shifts  which  result  from  the 
cation  (anion)  dangling  hybrids  at  the  anion  (cation)  va¬ 
cancy  site  coupling  to  adjacent  unbroken  bonds  and  anti¬ 
bonds  in  the  lattice.  Resulting  shifts  from  this  source  are 
large  and  negative  for  the  cation  vacancy  because  of  the  larg¬ 
er  magnitude  of  the  coupling  F, ,  and  the  deeper  energy  and 
resulting  stronger  coupling  of  the  anion  dangling  hybrid 
state,  with  respect  to  the  cation  states  at  the  anion  vacancy 
site. 

The  contribution  from  relaxation  and  Coulomb  energies 
to  both  anion  and  cation  extraction  energies  are  comparable 
and  small.  The  nearest-neighbor  atoms  are  found  to  relax 
away  from  the  vacancy  site  when  a  cation  is  removed,  while 
they  relax  toward  the  vacancy  site  when  an  anion  vacancy  is 
created.  For  both  cation  and  anion  vacancies,  calculated 
bond-length  shifts  are  small,  with  the  first-neighbor  bond 
lengths  shifting  by  less  than  2%.  The  breathing-mode  relax¬ 
ation  at  a  vacancy  site  has  often  been  assumed  to  be  similar  to 
the  relaxation  occurring  at  a  (111)  surface’’  where  the 
atoms  are  found  to  move  toward  the  bulk,  shortening  the 
bond  lengths  of  nearest-neighbor  bonds  by  8%-15%.  In 
contrast  to  some  previous  work,  we  find  that  the  bond- 
length  and  energy  shifts  in  the  relaxed  cluster  are  small  com¬ 
pared  to  the  accuracy  of  the  model;  inclusion  of  relaxation  in 
the  calculation  of  £,  and  £,  is  not  important.  The  Jahn- 
Teller  energy  £y,  resulting  from  the  direct  interaction  of  the 
dangling  hybrids  has  been  estimated  (see  Appendix  C),  and 
energies  of  the  order  of  —  0.6  eV  are  obtained. 

B.  Schottky  defects 

From  the  extraction  energies  and  Eqs.  (2)  and  (3) 
above,  Schottky  defect  formation  energies  were  calculated 
and  are  given  in  Table  I.  As  discussed  above,  in  the  com¬ 


pounds  the  Schottky  defect  formation  energy  £,.  is  physical¬ 
ly  meaningful  only  for  a  cation-anion  (bound  or  unbound) 
vacancy  pair,  since  the  creation  of  a  new  unit  cell  by  bringing 
atoms  to  the  surface  requires  creation  of  two  lattice  sites.  A 
comparison  of  £,  with  the  cohesive  energy  is  shown  in  Fig.  1. 
We  find,  as  expected,  that  the  £,  increases  with  increasing 
cohesive  energy,  and  that  the  group-IV,  -IIl-V,  and  -Il-Vl 
compounds  fall  into  groups  that  each  vary  nearly  linearly 
with  the  cohesive  energy,  but  that  have  different  slopes  and 
initial  values.  Also,  for  a  given  cohesive  energy,  £,  is  larger 
for  the  more  ionic  matenals  Compensation  effects,  which 
are  expected  to  be  largest  in  the  wide-gap  ionic  materials,  are 
not  included  in  this  calculation.  Observed  trends  may  be 
modified  by  the  inclusion  of  this  effect.  Also  shown  in  this 
figure  IS  the  quantity  £‘'J‘’  —  £  „h  for  the  compound  semi¬ 
conductors,  to  illustrate  the  variation  of  extraction  energy 
with  cohesive  energy  The  difference  in  this  energy  between 
the  anion  and  cation  is  larger  for  the  more  ionic,  group-ll- VI 
compound,  as  expected.  We  also  find  £‘^  <£  *  .  consistent 
with  the  observations  of  measured  extraction  energies  in 
CdS  and  CdTe.^" 

The  vacancy  concentration  can  be  found  by  minimizing 
the  change  in  the  Gibb’s  free  energy  AG  between  the  pure 
bulk  crystal  and  the  crystal  containing  n  vacancies.  For  non¬ 
interacting  vacancies,  the  Gibbs  free  energy  is  given  by 

^G  =  nE,-TS . ""(«)-(-£(«.£).  (ID 

where  T  is  the  temperature  and  is  the  configura¬ 

tional  entropy  of  the  n  vacancies.  F{n,T)  accounts  for  free- 
energy  changes  from  the  modification  of  the  phonon  fre- 


FIG  1  Comparison  of  the  Schottky  defect  formation  energy  (normalized 
per  atom )  and  the  cohesive  energy.  Squares  are  the  group-IV  elements,  sol¬ 
id  circles  are  group-IIt-V  compounds,  and  open  circles  are  group-Il-VI 
compounds.  Dashed  lines  represent  the  least-squares  fit  through  each  senes 
of  compounds,  where,  for  the  group-IV  elements,  carbon  ( not  shown )  has 
been  included  in  the  fit  Upper  (lower)  X  s  represent  the  quantity 
—  £■  ,,,  for  a  given  compound 
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quencies  because  of  the  presence  of  n  vacancies,  as  well  as 
additional  free-energy  changes,  such  as  entropy  associated 
with  the  Jahn-Teller  distortion  at  the  vacancy  sites  The  case 
most  often  treated  in  textbooks"  '■  is  where  the  number  of 
surface  sites  is  ignored  in  the  counting  of  the  entropy.  The 
number  of  vacancies  n  is  given  by"  as 

n  =  riNe  (12) 

where 


exp 


1  dF(n.T) 


kT 


dn 


)■ 


(13) 


and  n  is  the  number  of  sites  for  the  vacancy.  If  the  change  in 
the  surface  is  included  explicitly  in  the  calculation  of  the 
Schottky  defect  formation  energy,  one  must  include  the 
complete  information  about  the  surface.  Simply  including 
the  entropy  of  the  vacancy  atoms  removed  to  the  surface  in 
the  Bragg-Williams  approximation,  assuming  a  perfectly 
flat  surface  when  there  are  zero  vacancies  (see,  for  example, 
Landsberg  and  Canagaratna),”  is  not  sufficient,  A  full  ac¬ 
count  of  the  nature  of  the  surface,  to  include  the  atom/sur¬ 
face-vacancy  distributions,"  the  appropriate  entropy,  mul¬ 
tiple  surface  layers,  and  even  surface  reconstruction,  is  nec¬ 
essary  so  that  the  true  surface-bulk  equilibrium  can  be  de¬ 
scribed.  Because  the  problem  can  be  decoupled  into  two 
equikbria.  i.e.,  the  surface  with  the  ideal  bulk  and  the  vacan¬ 
cies  with  the  ideal  bulk,  the  equilibrium  vacancy  concentra¬ 
tion  can  be  expressed  by  Eq.  (12)  in  most  cases. 

Because  of  the  difficulty  of  measuring  the  vacancy  for¬ 
mation  energy,  only  a  limited  number  of  values  exists  in  the 
literature.  Even  for  silicon,  the  most  studied  of  the  semicon¬ 
ductors,  the  value  for  the  vacancy  formation  enthalpy  is  not 
firmly  established,  because  of  the  complication  of  diffusion 
experiments  by  the  possibility  of  multiple  simultaneous  dif¬ 
fusion  processes  such  as  vacancies  and  interstitials.  High- 
temperature  diffusion  in  Si  is  found  to  follow  a  typical  Arr¬ 
henius  behavior,  with  measured  activation  energy  of  4-6 
eV.''  The  activation  energy  in  diffusion  £■„,  is  given  by  the 
sum  of  the  vacancy  formation  energy  E,  and  the  migration 
energy  Migration  energies  measured  by  Watkins''’  are 
of  the  order  of  0.25  eV.  Combining  our  vacancy  formation 
energy  with  the  measured  migration  energies,  we  find  E„  s  6 
eV,  consistent  with  the  measured  activation  energies.  Our 
results  are  also  in  excellent  agreement  with  local  density, 
pseudopotential,  Green’s  function  calculations  of  Car  et 
al.'*  Note  that  both  the  present  results  and  those  of  Car  et  ai 
are  approximately  twice  the  value  of  E,  from  earlier  esti¬ 
mates. 

The  extraction  energies  are  more  difficult  to  measure  in 
the  semiconductor  compounds  because  of  the  presence  of 
two  classes  of  defects  on  the  cation  and  anion  sublattice.  In 
the  annealing  experiments  typically  done,  assumptions 
about  which  native  defects  are  dominant  must  be  inferred 
from  the  experiments,  and  additionally,  much  of  the  analysis 
must  employ  values  of  the  donor  or  acceptor  levels  of  the 
vacancies,  which  are  in  themselves  not  conclusively  identi¬ 
fied  in  most  compounds.  The  extraction  energies  in  a  num¬ 
ber  of  II-Vl  compounds  have  been  deduced  from  annealing 
experiments,  characterized  by  Hall  measurements.  Ray  and 
Kroger'"  have  measured  the  Zn  extraction  energy  for  ZnSe 


as  E^“  —  4  6  eV.  which  compares  to  our  calculated  value  of 
E.^  =  5.7  eV.  In  CdS,  Kumar  and  Kroger"'  have  measured 
E\'‘  =  4.0 eV'  and  E\  =7.3  eV  compared  to  our  calculated 
values  of  E\'^  =  6, 1  eV  and  E\  =8.8  eV.  In  CdTe,  Chern. 
Vydyanath,  and  Kroger’"  have  measured  E\^  =  4  7  eV  and 

=  6.0  eV  compared  to  our  calculated  values  of 
E\'‘  =  4.7  eV  and  £  "  =  7.4  eV  Finally,  in  Hg,  ,Cd,Te 
(jc  =  0.2),  Vydyanath'"  measures  £"‘^  =  2.2  eV,  while  we 
find  £“‘^  =  1.6  eV.  Note  that  in  general  our  calculated  val¬ 
ues  are  larger  than  the  measured  values,  although  for  the 
cation  vacancy  in  CdTe  our  agreement  with  the  measured 
values  IS  quite  good.  We  do  find  that  £""""  >  E'"^'""',  consis¬ 
tent  w  ith  experiment  Because  of  possible  contribution  from 
compensation  effects,  for  which  one  must  know  about  the 
localized  levels  in  the  gap,  care  must  be  exercised  w  hen  com¬ 
paring  our  results  directly  with  the  measured  Schottky  de¬ 
fect  formation  energies,  in  order  to  ensure  that  the  experi¬ 
mental  situation  approximates  the  initial  and  final  states 
treated  in  our  calculations. 

Baraff  and  Schliiter"  have  calculated  the  total  energies 
for  the  cation  and  anion  vacancies  in  GaAs  using  a  pseudo¬ 
potential  Green's  function  calculation  within  the  local  den¬ 
sity  approximation.  We  compare  our  results  with  their  value 
for  the  Schottky  defect  energy,  £"'*'  =  3.35  eV'/atom. 
where  both  vacancies  are  neutral.^"  The  corresponding  val¬ 
ues  from  the  present  work  is  £p‘*'  =  4.0  eV.  Jansen  and 
Sankey"’  have  also  calculated  the  vacancy  formation  ener¬ 
gies  for  several  compounds  using  an  ab  initio  pseudo-atomic- 
orbital  method.  Our  calculated  energies  are  in  good  agree¬ 
ment  with  their  values.  They  calculate  formation  energies  of 
4.8,  4.25,  4.3,  and  3.7  eV  in  GaP,  GaAs,  ZnSe,  and  ZnTe, 
respectively.  These  compare  to  our  values  of  5.0.  4.0,  4.6, 
and  4.2  eV  for  the  same  compounds.  As  in  the  comparison 
with  experiment,  we  find  that  the  present  tight-binding  mod¬ 
el  in  general  yields  energy  values  which  are  higher  than  the 
ab  initio  values. 

C.  II-Vl  pseudobinary  alloys 

While  the  cluster  approximation  is  expected  to  intro¬ 
duce  some  error  in  the  calculated  extraction  energies,  as  not¬ 
ed  above,  the  results  presented  here  emphasize  the  relative 
energies  of  different  local  clusters  in  the  alloys,  and  hence  the 
cluster  -'alculation  should  not  change  our  major  conclu¬ 
sions.  Corrections  from  Jahn-Teller  distortion  ^.nd  Cou¬ 
lomb  energies  have  not  been  included  in  this  alloy  study. 

1.  Common  anion  alloys 

We  begin  by  examining  in  detail  the  extraction  energies 
in  one  II-VI  common  anion  alloy.  The  Te  common  anion 
alloy  system  has  been  chosen  as  an  example  because  of  the 
technological  importance  of  these  alloys.  HgCdTe  and 
HgZnTe  are  both  narrow-gap  alloys  in  the  Hg-rich  concen¬ 
trations  with  infrared  device  applications,  and  CdZnTe  is 
often  employed  as  a  substrate  material  for  HgCdTe  and 
HgZnTe.  Additionally,  the  alloy  variation  of  £^  in  this  sys¬ 
tem  is  representative  of  the  behavior  found  in  other  alloy 
systems. 

Cation  and  anion  extraction  energies  in  the  Te  common 
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aniun  auuyb  j«-Uo,  le,  v^Uo,ngo,  ic,  ana  Hgu,i:.no.5  le 
are  shown  in  Fig.  2  as  a  function  of  the  near-neighbor  com¬ 
position.  These  and  the  following  alloy  results  are  for  clus¬ 
ters  with  53  atoms.  For  the  Te  anion  vacancy,  the  five  near¬ 
neighbor  arrangements  are  individually  unique  and 
correspond  to  the  values  shown  in  Fig.  2.  The  values  for  the 
Zn,  Cd,  and  Hg  cation  extraction  energies  in  Fig.  2  corre¬ 
spond  to  an  average  over  the  unique  arrangements  of  atoms 
in  the  second-neighbor  positions,  as  shown  explicitly  in  Fig. 
3.  The  random-probability-weighted  average  extraction  en¬ 
ergy  as  a  function  of  composition  x  is  shown  as  the  solid  line 
in  Fig  4,  with  the  rms  deviations  shown  as  dashed  lines. 

First,  we  discuss  the  anion  extraction  energies  in  the 
CdHgTe  alloy,  shown  in  the  center  panel  in  Fig.  2.  In 
CdHgTe,  the  dependence  of  the  Te  extraction  energy  on  n, 
the  number  of  Hg  atoms  in  the  first-neighbor  sites,  is  easily 
understood  within  the  context  of  Harrison’s  bond-orbital 
model.'’  Although  results  are  interpreted  using  bond-orbi¬ 
tal  model  concepts,  the  results  presented  are  those  from  a  full 
cluster  calculation.  There  are  two  primary  contributions  to 
the  extraction  energy  which  dominate  the  overall  depend¬ 
ence  of  on  n.  The  first  contribution  to  accounts  for  a 
gradual  increase  in  £  ,  in  going  from  the  Cd  |Hg,  totheHg^ 
cluster  and  is  due  primarily  to  the  differences  in  the  bonding 
energy  e„  ( Ref  1 7 )  of  the  four  bonds  which  are  broken  upon 
vacancy  formation.  Since  the  bonding  energy  level  of  the 
HgTe  bond  is  lower  than  for  the  CdTe  bond,  i.e., 
because  of  the  deeper  s  and  p  states  on  the  Hg 
compared  to  Cd,  this  contribution  to  the  £  ^  will  result  in  an 
increase  with  an  increase  in  the  number  of  Hg  cations  adja¬ 
cent  to  the  vacancy  site.  The  breaking  of  the  four  bonds  also 
results  in  regaining  the  repulsive  bond  energy  F„,  which  is 
smaller  for  the  slightly  shorter  bond-length  compound 
CdTe.  This  contnbution  to  £..  will  produce  a  decrease  in 
£,  w  ith  the  increasing  number  of  HgTe  bonds  broken,  but 
the  magnitude  is  small  because  of  the  nearly  equal  lengths  of 
the  CdTe  and  HgTe  bonds.  The  net  result  of  both  these 
bond-breaking  contributions  to  £^  is  to  produce  an  increase 
in  £,  with  an  increasing  number  of  Hg  atoms  adjacent  to 
the  vacancy,  as  seen  in  Fig,  2.  The  second  contribution  to£, 
that  dominates  the  n  dependence  is  related  to  the  energy  of 


FIG  i  Cation  cMraction  energy  in  Te  common  anion  alloys  plotted  for  the 
unique  configurations  of  second-nearesi-neighbor  cations,  for  bulk  concen. 
iraiion  x  ~  0  5  The  open  circles  correspond  to  the  extraction  energies  of 
unique  configurations;  the  filled  circles  correspond  to  the  probability- 
weighted  average 

the  electrons  in  the  dangling  hybrid  levels.  When  a  Te  atom 
is  removed  from  CdHgTe,  the  six  outer-shell  anion  electrons 
are  removed  with  the  neutral  Te  atom,  to  leave  two  electrons 
in  the  four  cation  dangling  hybrids.  The  energy  levels  that 
these  electrons  occupy  depend  on  the  cation  species  which 
are  adjacent  to  the  vacancy  site.  The  atomic  hybrid  ener¬ 
gy  of  Hg  is  lower  than  that  of  Cd,  £!;'*'<  (f  is  used 
throughout  to  refer  to  the  free  atomic  values,  and  e  is  used  to 
refer  to  the  calculated  cluster  values).  As  expected,  a  similar 
relationship  is  found  between  the  calculated  dangling  hybrid 
energies  at  the  vacancy  site,  where  we  have  ej,'*'  <  Thus,  if 
at  least  one  Hg  atom  is  adjacent  to  the  vacancy  site,  the  two 
electrons  will  occupy  the  lower-energy  Hg  dangling  hybrid 
level,  and  their  contribution  to  the  extraction  energy  will  be 
approximately  constant.  For  the  Cdj  cluster,  the  two  elec¬ 
trons  must  occupy  the  higher-energy  Cd  dangling  hybrid 
orbitals;  this  accounts  for  the  relatively  large  jump  in  the 
extraction  energy  in  going  from  the  Cd,  to  Cd,  Hg,  cluster 
in  CdHgTe,  as  seen  in  Fig.  2. 

A  similar  interpretation  follows  for  the  anion  extraction 
energy  in  HgZnTe,  shown  in  the  third  panel  of  Fig.  2.  For  the 
HgZnTe  system,  we  have  ef"  >  e”*,  thus  the  two  electrons 
which  occupy  the  dangling  hybrids  will  occupy  the  lower- 
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FIG.  2.  Extraction  energies  in  the  Te  common  anion  alloys  as  a  function  of 
near-neighbor  configuration  for  bulk  concentration  x  =  0.5.  In  Figs.  2  and 
3- 14.  the  solid  line  corresponds  to  ihc  average  exlraclion  energies  and  the 
dashed  lines  to  the  rms  deviations 


FIG.  4.  Exlraclion  energies  in  the  Te  common  anion  alloys  as  a  function  of 
composition  x. 
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lying  Hg  levels.  Forthe  Hgj.HgiZn,  Hg.Zn,.and  Hg|Zn, 
clusters,  this  contribution  to  is  nearly  constant  and  it 
abruptly  increases  for  the  Znj  cluster,  as  is  observed  in  Fig 
2  The  other  primary  contributions  to  £,  come  from  the 
bonding  energy  and  repulsive  energy  of  the  bond  breaking 
These  two  contributions  produce  opposite  slopes  of  compar¬ 
able  magnitude  between  the  Hgj  and  Zn4  clusters,  resulting 
in  £,  {x,n)  nearly  independent  of  n.  Finally,  for  the  Te  va¬ 
cancy  in  ZnCdTe,  shown  in  the  first  panel  of  Fig.  2,  the 
dependence  of  the  contribution  to  from  the  electrons  in 
the  dangling  hybrids  on  the  number  of  Zn  and  Cd  atoms 
adjacent  to  the  vacancy  site  is  not  as  important  as  in  CdHgTe 
and  HgZnTe,  because  the  difference  in  the  dangling  hybrid 
energies  ef"  and  Ca‘‘.  is  small.  For  the  bonding  energy  contri¬ 
bution  to  E^  ,  we  find  and  >  re¬ 

sulting  in  a  net  decrease  in  E,^  with  an  increasing  number  of 
CdTe  bonds. 

As  noted  above,  the  cation  extraction  energies  in  Fig.  2 
represent  an  average  over  the  unique  configuration  of 
cations  in  the  second-neighbor  shell  E^  for  the  unique  sec¬ 
ond-neighbor  cation  configurations  are  shown  in  Fig.  3.  The 
regular  structure  observed  in  Fig.  3  is  in  part  due  to  the  use  of 
a  Bethe  lattice  and  the  additional  degeneracies  it  superposes 
on  the  lattice.  The  variations  in  £,  arises  from  shifts  in  the 
energy  of  the  four  broken  bonds  and  the  vacancy  localized 
states  through  interaction  with  the  second-neighbor  alloy 
bonds.  For  a  true  zinc-blende  lattice,  we  expect  a  smaller 
dispersion  about  the  average  value  shown  in  Fig.  3. 

The  overall  dependence  of  the  anion  extraction  energy 
on  X,  shown  in  Fig.  4,  follows  the  same  general  trends  as  in 
Fig.  2.  In  CdHgTe  and  HgZnTe,  the  downward  bowing  of 
£^  (/I)  results  in  a  downward  bowing  in  E ,  (x).  Also,  the 
comparatively  large  variation  of  the  Te  extraction  energy 
relative  to  changes  in  the  near-neighbor  environment  results 
in  a  large  rms  deviation,  particularly  for  the  Cd-rich  compo¬ 
sitions  of  CdHgTe  and  Zn-rich  compositions  of  HgZnTe. 
For  the  cation  vacancy,  the  variation  of  the  extraction  ener¬ 
gy  with  cluster  composition  is  of  smaller  magnitude  than  for 
the  anion  vacancy,  thus  resulting  in  a  relatively  small  rms 
deviation,  although  the  downward  bowing  of  the  cation  va¬ 
cancy  plot  as  a  function  of  n  results  in  a  small  downward 
bowing  in  the  Jr-dependent  function. 

For  the  anion  extraction  energy  in  all  of  the  common 
anion  II-VI  •>.lloys,  the  break  from  linearity  of  the  extraction 
energy  versus  cluster  composition  n  always  occurs  at  the 
«  =  1  or  3  cluster,  as  was  observed  for  the  Te  system  in  Fig. 
2.  This  is  because  the  two  electrons  which  occupy  the  dan¬ 
gling  hybrids  will  always  occupy  the  lowest  available  hybrid 
level.  For  an  alloy  „B„  C,  where  e  ',  >  e^,  these  two  elec¬ 
trons  will  occupy  a  B  dangling  hybrid,  except  in  the  n  =  0 
cluster  where  no  B  atoms  are  present,  resulting  in  an  abrupt 
increase  in  £,  .  Because  this  break  from  linearity  is  mostly 
related  to  the  cation  hybrid  energy  only,  one  notes  that  if  the 
break  from  linearity  occurs,  for  example,  at  n  =  1  for 
A^_  „B„C,  it  will  also  occur  at  «  =  1  for  all  A^  „B„D  al¬ 
loys,  as  is  observed  by  comparing  Figs.  2, 5,  and  6.  This  break 
in  linearity  of  the  curves  can  be  predicted,  since  f"*'  <  f**', 
f*'  <  and  Additionally,  the  sign  of  the  slopes 

can  be  predicted  within  the  simple  bond-orbital  model  by 


comparing  the  major  alloy-dependent  contribution.  E^, 
=  8f,,  +  4F„,  for  the  different  systems.  The  bond-orbiial 
model  predicts  £  >  E  >  £h,‘'^  for  X  =  S,  Se.  and  Te 

This  is  in  agreement  with  the  sign  of  the  slopes  calculated 
within  the  cluster  calculation  and  shown  in  Figs.  2,  5,  and  6. 
Although  the  bond-orbital  model  is  sufficient  to  predict  the 
general  trends  observed,  the  magnitudes  of  the  slopes  must 
be  determined  from  the  cluster  calculation,  which  includes, 
for  example,  the  back-branch  metallization  effects  not  calcu¬ 
lated  in  the  simple  bond-orbital  analysis  above. 

As  was  found  for  the  Te  common  anion  alloy,  the  aver¬ 
age  cation  extraction  energies  in  other  Il-Vl  alloys  \ary 
nearly  linearly  with  the  cation  concentration  in  the  second- 
nearest-neighbor  shell.  The  cation  extraction  energies  in  the 
Il-VI  alloys  vary,  depending  on  the  particular  arrangement 
of  atoms  in  the  second-shell  sites,  as  was  shown  for  Te  in  Fig. 
3.  Although  these  values  for  each  alloy  system  are  not 
shown,  a  measure  of  the  variation  is  shown  in  the  rms  devia¬ 
tions  in  the  plots  of  £^  (n). 

2.  Common  cation  alloys 

The  structure  in  the  cation  extraction  energy  versus  n 
curves  for  the  Il-Vl  common  cation  alloys.  Figs.  7-9,  can  be 
interpreted  in  a  manner  similar  to  the  anion  extraction  in  the 
11-Vl  common  anion  alloys  From  the  bond-orbital  model, 
we  find  £h,'‘>£h,^‘  >£h,''  for  X  =  Zn.  Cd.  and  Hg  Thus 
we  expect  this  contribution  to  E ^  will  increase  linearly  for 
increasing  number  of  S  atoms  in  the  XSj  „Se„  and 
XTe^  „S„  clusters,  and  will  increase  linearly  with  increas¬ 
ing  Se  atoms  in  XSej  „Te„  clusters.  The  second  contribu¬ 
tion  that  dominates  the  n-dependent  behavior  of  £ ,  is  that 
of  the  energy  of  the  six  electrons  in  the  dangling  hybrids. 
From  the  atomic  term  values,  we  find  e]'  >  >  ej, ;  thus  we 

would  predict  that  the  electrons  will  occupy,  in  order  of  pre¬ 
ference  (lowest  energy  first)  S.  Sc.  then  Te  hybrids,  as  is 
observed  from  the  results  of  the  cluster  calculations.  For 
example,  £^  for  the  cluster  XSj  „Se„  can  be  expressed  as 

£,  =  C„  +  C| /I -f  ne^  + /?c^,  (14) 

where  C„  ■+  C,  n  includes  the  composition  independent  and 
£h,  (nearly  linear)  terms,  and  a  =  6.  £=  0,  for  n  =  0.1; 
a  =  4,  /?  =  2.  for  n  =  2;  a  =  2,  =  4.  for  «  =  3;  and  a  =  0. 


n-2n._,Cd,S  n 


n:Zn,,.,Cd,S  n  Cd,)— Hg.S  n  Hg,j.,2n,S 


FIG.  5.  Extraclion  energies  in  the  S  common  anion  alloys  as  a  funclion  of 
near-neighbor  configuration  for  bulk  conceniralion  of.r  =  0.5. 
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FIG.  6.  Ejitraclion  energies  in  Ihe  Se  common  anion  alloys  as  a  function  of  FIG  8  Extraction  energies  in  the  Cd  common  cation  alloys  as  a  function  of 
near-neighbor  configuration  for  bulk  concentration  of  .t  =  0.5  near-neighbor  configuration  for  bulk  concentration  of  x  =  0-5 


P  =  tjoxn  —  4.  Note  that  between  n  =  0  and  1 ,  the  slope  of 
£,  versus  n  is  determined  by  the  variation  of  with  n.  For 
n  =  1-4,  the  slope  is  a  sum  of  the  E^,,  contribution  plus  a 
contribution  from  the  electrons  in  the  dangling  hybrids. 
Consider  the  Zn  vacancy  in  ZnS4  _  „  Se„  -  The  C„  -f  C, «  con¬ 
tribution  to  E^  will  decrease  with  increasing  n  because 
E  >  E  This  slope  can  be  seen  between  n  =  0  and  1  in 
Fig.  7.  The  electrons  in  the  dangling  hybrids  will  produce  an 
increasing  contribution  to  E  ^  with  increasing  n  because 
ff,  <ef'-  The  sum  of  the  C,,  -f  C,  «„  term  (positive  slope) 
plus  the  energy  from  the  electrons  in  the  dangling  hybrids 
(negative  slope  between  n  =  1  and  4) yields  a  net  negative 
slope  for  ;?  =  1-4.  A  similar  analysis  follows  for  the  cation 
vacancy  in  other  II-VI  common  cation  alloys.  Once  again, 
the  general  shape  and  sign  of  the  slopes  of  the  E^  curves  can 
thus  be  predicted,  based  on  the  bond-orbital  analysis,  al¬ 
though  the  more  detailed  cluster  calculation  is  needed  for  a 
quantitative  description. 

The  average  anion  vacancy  in  these  alloys  varies  nearly 
linearly  with  n  as  expected,  because  the  alloy  dependence 
enters  in  the  second-neighbor  sites.  A  measure  of  the  extrac¬ 
tion  energies  sensitivity  to  the  detailed  arrangement  in  the 
second-neighbor  sites  can  be  gathered  from  the  rms  devia¬ 
tions  indicated  in  the  dashed  lines  in  Figs.  7-9. 


n:ZnS._.Se,  n:ZnSe4-,Te„  n:ZnTe.,-,S, 


FIG.  7.  Extraction  energies  in  the  Zn  common  cation  alloys  as  a  function  of 
near-neighbor  configuration  for  bulk  concentration  of  .r  =  0.5 
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D.  Ill-V  pseudobinary  alloys 
1.  Common  anion  alloys 

For  anion  extraction  energies  in  the  Ill-V  common  an¬ 
ion  alloys  shown  in  Figs.  10-12,  the  analysis  is  similar  to  that 
in  the  II-Vl  alloys.  As  above,  we  use  atomic  term  values  and 
the  bond-orbital  model  to  interpret  our  results.  We  first  note 
that  <e^'<  fl,"  and  E';'^  <  £*/'  <  £^-'^  for  X  =  P,  As. 
and  Sb.  The  analysis  is  as  follows,  using  the  P  vacancy  in 
Ga^  „In„£  (center  panel  of  Fig.  10)  as  an  example.  Be¬ 
cause  £  <  £5,’,'*’,  this  energy  produces  a  linear  contribu¬ 

tion  to  E ,  ,  with  negative  slope,  as  shown.  The  three  elec¬ 
trons  at  the  vacancy  site  will  preferentially  occupy  the 
lower-energy  Ga  hybrid  states.  Thus  £,  can  be  written  as 

£,  =  C„  -(-  C,  n  -F  ae';;-'  +  He]'',  (15) 

where  a  =  3,  £  =  0,  for  «  =  0-2;  a  =  1,  p  =  1,  for  n  =  3; 
and  a  =  0,  £  =  3,  for  n  =  4.  Thus,  for  n  =  0-2,  the  slope  of 

E ,  vs  n  will  be  negative,  dominated  by  the  back-branch 
energy  dependence  on  n.  For  n  =  3  and  4,  the  three  electrons 
occupying  the  dangling  hybrid  levels  contribute  an  energy  to 
£,  which  increases  with  increasing  n. 

A  similar  analysis  follows  for  the  other  III-V  common 
anion  alloys.  With  one  exception,  the  behavior  of  £,  vs  n 
follows  the  simple  bond-orbital  analysis  presented  above. 


n;HgS»_,Se„  n:HgSe»_Te,  n:HgTev..S, 


FIG.  9.  Extraction  energies  in  the  Hg  common  cation  alloys  as  a  function  of 
near-neighbor  configuration  for  bulk  concentration  ofx  =  0.5. 
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n  Al4.„Go.P  n  G04_Jn.P  n  in4.4AtP 


n  Ai*_,Go«SD  n  G04-«‘ri^Sb  n  in*_.A;,So 


FIG.  10.  Exiraciion  energies  in  ihe  P  common  anion  alloys  as  a  function  of  FJG  12  Extraction  energies  in  l he  Sb  common  anion  alloys  as  a  function  of 

near-neighbor  configuration  for  bulk  concentration  ofv*  =  0.5.  near-neighbor  configuration  for  bulk  concentration  of  x  =  0.5 


The  exception  is  the  In4  _  „  A1„X  alloys,  for  X  =  P,  As,  and 
Sb.  Here  the  atomic  term  values  suggest  that  the  A1  hybrid 
levels  are  of  lower  energy  than  the  In  hybrid  levels,  <  e),", 
while  the  cluster  calculations  find  the  opposite,  i.e.,  e^'  >  ^J,". 
The  reasons  for  this  reversal  are  related  to  the  coupling  of  the 
dangling  hybrids  at  the  vacancy  site  to  bonds  in  the  rest  of 
the  cluster,  which  has  been  included  in  the  cluster  diagonali- 
zation;  this  can  result  in  a  significant  energy  shift  of  the  dan¬ 
gling  hybrid  levels.  We  have  found  that  for  most  systems, 
this  shift  does  not  alter  the  relative  order  of  the  hybrid  energy 
levels. 

An  analysis  of  the  cation  extraction  energies  follows 
that  presented  for  the  II-VI  Te  alloy  above.  As  was  found  for 
the  II-VI  common  anion  alloys,  the  cation  extraction  ener¬ 
gies  are  not  as  sensitive  to  the  environment  as  are  the  anion 
extraction  energies,  because  the  effects  of  alloying  come  in 
the  second-neighbor  sites.  The  rms  deviations  indicated  as 
dashed  lines  in  Figs.  10-12  provide  an  indication  of  this  sen¬ 
sitivity. 

2.  Common  cation  alloys 

For  cation  extraction  energy  in  the  III-V  common  ca¬ 
tion  alloys  shown  in  Figs.  13-15,  the  analysis  is  similar  to 
that  in  the  II-VI  alloys.  Using  the  atomic  term  values  and 
bond-orbital  model  to  interpret  the  results,  we  first  note  that 


ft"  and  E^^<E  <  E  for  X  =  Al,  Ga.  and 

In.  The  analysis  is  as  follows,  using  the  Ga  vacancy  in 
GaAs4 ,  „Sb„,  as  shown  in  the  center  panel  of  Fig.  14.  Be¬ 
cause  this  energy  produces  a  linear  contribu¬ 

tion  to  £,  with  a  negative  slope  versus  n.  The  five  electrons 
at  the  vacancy  site  will  preferentially  occupy  the  lower-ener¬ 
gy  As  hybrid  states.  Thus  £,  can  be  written  as 

£,  =  C„ -I- C,  M -I- ac*' -I- (16) 

where  a  =  5.  =  0,  for  /i  =  0, 1 ;  a  =  4.  ^  =  1 ,  for  «  =  2; 

a  =  2.  £  =  3,  for  n  —  3;  and  a  =  0,  £  =  5,  for  n  —  4.  Thus, 
for  «  =  0-1,  the  slope  of  £^  vsw  will  be  negative,  dominated 
by  the  dependence  of  £h,  on  /;.  For  n  =  2-4,  E ^  will  have  a 
decreasing  (with  n)  contribution  from  C,  n  and  an  increas¬ 
ing  contribution  from  the  electrons  in  the  dangling  hybrids. 
The  increase  in  energy  occurs  because  the  five  electrons  at 
the  vacancy  site  occupy  the  higher-energy  Sb  hybrid  levels. 
Thus  the  slope  will  be  nearly  linear  between  n  =  2  and  4, 
because  of  the  linear  increase  from  the  electrons  in  the  dan¬ 
gling  hybrid  levels.  Again,  a  similar  analysis  follows  for  the 
other  III-V  common  cation  alloys.  All  of  the  IIl-V  common 
cation  alloys  follow  the  qualitative  behavior  predicted  by  the 
atomic  term  values  e,,  and  the  bond-orbital-model  results. 
And,  as  expected,  the  average  anion  extraction  energies  are 
found  to  vary  nearly  linearly  with  n. 


n:AU.„Ga„As  n:Ga4_Jn,>Vs  n:ln».,AI.As 


n:AIP4.,As,  n.AIAs,.„Sb,  n  AlSb*_,P, 


FIG.  11.  Extraction  energies  in  the  As  common  anion  alloys  as  a  function  of  FIG.  13.  Extraction  energies  in  the  Al  common  cation  alloys  as  a  function  of 

near-neighbor  configuration  for  bulk  concentration  of  x  =  0.5.  near-neighbor  configuration  for  bulk  concentration  of  x  =  0.5. 
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n  Cop4-*A8«  n  GaAs«^Sb»  n;CaSb4-«P« 


4 


j  :  .1  ,  .  ! 

h  i  ■  4  ^  '  A  ■  '  b'^  i  ■  *  ‘  4  “  i  ‘  J  4  ■  i  ‘  4  ‘  t  'o  *  >  2 

Gg^,;_,As„  GaAs,2-«SD,  n  GaSbt2-'»P« 

FIG.  14.  E.xiraciion  energies  in  the  Ga  common  canon  alloys  as  a  function 
of  near-neighbor  conhguraiion  for  bulk  concentration  of  .r  =  0  5 


E.  Consequences  of  alloy  variation  of  f  ^ 

The  predicted  nonlinear  variation  of  common-species 
extraction  energies  in  alloys  will  have  an  important  impact 
on  the  calculation  of  vacancy  densities  in  these  materials.  In 
particular,  the  presence  of  a  minimum  in  implies  larger 
vacancy  concentrations  than  a  simple  linear  variation  would 
predict.  To  demonstrate  this,  we  choose  for  an  example  the 
Te  vacancy  in  the  Te  common  anion  alloys.  As  discussed  in 
Sec.  Ill  B  above,  the  definition  of  the  Schottky  defect  forma¬ 
tion  energy  of  the  individual  constituents  in  the  compounds 
and  alloys  is  not  rigorously  defined,  and  as  a  consequence, 
the  calculation  of  the  vacancy  concentration  in  these  materi¬ 
als  requires  additional  information  about  the  system,  e.g., 
stoichiometry  and  external  phases.  To  demonstrate  the  ef¬ 
fect  that  the  nonlinear  variation  in  £.,  can  have  on  proper¬ 
ties  such  as  the  vacancy  concentration,  we  calculate  a  some¬ 
what  fictitious  quantity  n,  as  defined  below.  This  quantity 
has  no  direct  significance,  but  does  demonstrate  the  impact 
that  a  nonlinear  variation  in  can  have  on  measurable 
properties.  We  begin  by  defining  n^"(.t,n),  which  is  related 
to  the  Te  vacancy  density  for  a  given  class  of  cluster  n,  in  the 
alloy  of  composition  x,  as  follows. 

=  T](x,n).\\,P„  exp[  -  E ,  (x,n)/kT],  (17) 
where  k  is  Boltzmann  s  constant,  T  is  the  temperature  in 


n:lnp4.^S„  n  InAs^.^Sb,  n;lnSb4_„P» 


FIG  1 5  Extraction  energies  in  the  In  common  cation  alloysasa  Tunctionor 
near-ncighbor  configuration  for  bulk  concentration  of  .t  =  0.5. 


Kelvin,  No  is  the  anion  site  density,  is  the  random  proba¬ 
bility  factor,  given  by  Eq.  (6),  and  Jjfx.n)  is  as  defined  in  Eq. 
(13).  Note  that  we  have  used  the  calculated  zero-tempera¬ 
ture  value  of  £,  ;  more  properly,  the  finite-temperature  val¬ 
ue,  including  the  effect  of  the  Fermi  level,  should  be  used 
here.  Summing  over  all  classes  of  clusters,  we  find 

n^-ix)  =  (18) 

and  normalizing  to  for  pure  CdTe,  we  find 


n’^lCdTe) 

=  (^)  e.xp{[£:^(CdTe) 

-  El^(x)]/kT}.  (19) 

where  we  have  assumed  all  77's  are  equal,  's  the  lattice 
constant  of  (ZdTe  and  a ,  is  the  volume-averaged  lattice  con¬ 
stant  for  the  alloy  of  composition  x.  Values  of  nj^ix)  are 
shown  in  Fig.  16  for  £=  800  K.  Shown  for  comparison  are 
the  calculated  values  based  on  extraction  energies  and  as¬ 
suming  a  simple  linear  interpolation  between  then  =  0and4 
clusters  of  Fig.  2.  The  difference  between  the  linear-interpo- 
lated  values  and  the  calculated  values  is  quite  dramatic,  espe¬ 
cially  in  theCdHgTe  alloy  where  n^  (x)  differ  by  as  much  as 
five  orders  cf  magnitude;  the  difference  will  be  smaller  in 
calculations  for  which  the  final  states  on  the  surface  are  con¬ 
sidered.  and  thus  an  energy  related  to  £,„„  is  subtracted  from 
£,  .  Note  that  because  of  the  minimum  in  £,  ,  the  vacancy 
concemralion  for  some  composition  in  the  alloy  is  found  to 
be  greater  than  in  the  constituent  compounds.  For  the  par¬ 
ticular  case  of  HgCdTe  (center  panel  of  Fig.  16),  the  differ¬ 
ence  between  fi  for  the  linear  and  nonlinear  £,  is  greatest  for 
the  Cd-rich  alloys.  We  thus  predict  that  even  a  small  addi¬ 
tion  of  HgTe  to  CdTe  may  result  in  a  significant  enhance¬ 
ment  in  the  vacancy  density  in  the  resulting  alloy,  compared 
to  that  in  pure  CdTe  Similar  results  are  expected  for  other 
alloys. 

Next  we  consider  the  cation  vacancies  in  the  Te  com¬ 
mon  anion  alloys.  Analogous  to  Eq.  (18)  above,  we  calcu¬ 
late 


FIG.  16.  Te  common  anion  alloys  using  the  calculated  Te  extraction  ener¬ 
gies  from  Fig.  2  (solid  line)  and  linearly  interpolated  extraction  energy 
based  on  the  X  =  0  and  I  values  from  Fig.  1  (dashed  line) 
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12  ^ 

«:•" . (x.«)  =  A’„^  p„  y  v(xM,g,.)P^ 

"  II  nT"  I 

xexp[  —  (A,«,g„  )/^^  ]  (20) 

where  P„  and  P^  are  as  defined  in  Sec.  Ill  As  above,  we 
normalize  to  n;  '"""  in  CdTe.  Values  of  h)"'""'  are  shown  in 
Fig.  17for  r=  BOOK.  Because  the  nonlinearitv  for  an  atom 
of  a  substituted  species  was  found  to  be  less  than  for  the 
common  atom  species,  we  find  a  less  dramatic,  although  still 
substantial,  difference  between  the  linear  and  nonlinear  re¬ 
sults. 

From  the  above  discussion,  we  see  that  the  effects  of  a 
nonlinear  variation  of  (n)  will  be  most  important  for  the 
common  species.  One  consequence  of  £^  (n)  reaching  a 
minimum  for  finite  alloy  concentration  may  be  a  noticeable 
enhancement  in  the  diffusion  of  the  common  species  in  the 
alloy,  for  atoms  that  diffuse  by  a  simple  vacancy  mechanism 
across  the  complete  composition  range,  and  for  materials  in 
which  the  quality  of  the  alloys  is  comparable  to  that  of  the 
constituent  compounds,  so  that,  for  example,  enhanced  dif¬ 
fusion-down  dislocations  are  not  a  factor.  For  example,  Te  is 
believed  to  diffuse  by  an  interstitial  mechanism'"  in 
Hg,  _  ,Cd,  Te,  at  least  for  low  a;  therefore,  we  would  not 
expect  to  observe  an  enhanced  diffusion  based  on  a  simple 
vacancy  mechanism  in  this  system.  A  second  possible  conse¬ 
quence  of  the  nonlinear  variation  in  £  and  subsequent  effect 
on  vacancy  concentrations  may  be  enhanced  diffusion  via 
percolation.  It  has  been  suggested"*'  that  percolation  effects 
might  play  a  role  in  diffusion  in  HgCdTe.  Because  vacancy 
densities  will  be  greater  on  particular  classes  of  sites,  deter¬ 
mined  by  the  variations  shown  in  Fig.  2,  migration  from  one 
low-energy  site  to  another  will  be  energetically  favored  over 
migration  to  a  higher-energy  site,  assuming  migration  ener¬ 
gies  do  not  vary  by  as  great  an  amount  as  E.,  .  If  there  are 
enough  low-energy  sites  to  form  percolation  paths,  then  two 
parallel  diffusion  mechanisms  may  be  present.  This  kind  of 
multiple  diffusion  coefficient  profile  is  observed  for  Hg  in 
Hg,  _  jCd^Te.'"  '"  It  remains  to  be  proved  that  percolation, 
rather  than  diffusion-down  dislocation  cores  or  some  other 
extrinsic  mechanism,  is  the  explanation  for  the  Hg  diffusion 
results.  If  percolation  is  the  explanation,  then  based  on  our 


FIG.  17.  Normalized  canon  vacancy  concent ralions  in  the  Te  common  an¬ 
ion  alloys  using  the  calculated  cation  extraction  energies  from  Fig.  2  (solid 
line)  and  linearly  interpolated  extraction  energy  based  on  the  x  =  0  and  1 
values  from  Fig  1  (dashed  line). 


extraction  energies,  percolation  should  be  a  feature  of  self¬ 
diffusion  in  many  alloys. 

If  deep  levels  exist  because  of  the  presence  of  vacancies, 
the  populations  of  various  classes  of  vacancies  in  the  alloys 
may  be  measured  by  various  techniques,  such  as  deep-level 
transient  spectroscopy  or  phoioluminescence,'*' Results 
on  the  relative  population  of  each  class  of  clusters  will  de¬ 
pend  on  the  correlation  state  of  the  alloy,  its  randomness, 
and  the  vacancy  formation  energy  for  the  given  class  of  clus¬ 
ter.  A  deviation  from  a  simple  random  population  for  the 
various  cluster  types  could  indicate  either  a  correlated  alloy, 
driven  by  chemical  or  strain  terms  in  the  bulk,  or  a  variation 
in  the  formation  energy  in  the  various  classes  of  clusters 

V.  SUMMARY  AND  CONCLUSION 

We  have  found  extraction  energies  calculated  using  a 
tight-binding  cluster  Hamiltonian  give  good  agreement  w  ith 
the  tight-binding  Green's  function  results,  in  particular  for 
the  cation  vacancies.  Our  results  indicate  that  the  extraction 
and  Schottky  defect  formation  energies  increase  with  in¬ 
creasing  cohesive  energy.  In  a  given  compound,  the  calculat¬ 
ed  extraction  energies  are  found  to  be  larger  for  the  anion 
than  the  cation,  in  agreement  with  the  experimental  results, 
with  the  difference  between  the  cation  and  the  anion  being 
larger  in  the  Il-Vl  than  in  the  III-V  compounds.  Compari¬ 
son  of  our  calculated  values  with  experiments  and  previous 
theory  show  good  agreement  for  Si  and  CdTe.  although  our 
calculated  values  are  in  general  larger  than  experiments  for 
those  compounds  for  which  data  are  available.  This  may  be 
due  to  differences  in  the  occupation  of  localized  levels,  w  hich 
may  be  poorly  modeled  in  our  cluster  calculation,  particu¬ 
larly  for  the  anion  vacancy  which  in  general  creates  donor 
levels  in  the  gap.  For  the  extraction  energy  of  an  atom  of  the 
substituted  species  in  the  alloys,  we  find  a  nearly  linear  de¬ 
pendence  on  the  near-alloy  environment,  as  expected,  be¬ 
cause  the  substitution  is  in  second-neighbor  sites.  We  have 
found  a  large  variation  in  the  extraction  energy  with  the 
near-neighbor  environment  for  the  removal  of  an  atom  of  the 
common  species  for  which  the  alloy  substitution  is  in  the 
first-neighbor  shell.  In  some  alloys  we  have  found  the  extrac¬ 
tion  energy  to  reach  a  minimum  away  from  the  compound 
endpoints,  indicating  the  possibility  for  enhanced  vacancy 
densities,  and  consequently  enhanced  diffusion,  in  the  al¬ 
loys. 
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APPENDIX  A:  COULOMB  ENERGY 

In  the  zinc-blende  semiconductor  compounds,  the 
cations  and  anions  in  the  lattice  carry  a  net  charge  of  magni¬ 
tude  ]Z  1,  but  of  opposite  sign  at  the  two  lattice  sites.  When  a 
neutral  cation  or  anion  is  removed  from  the  zinc-blende  lat¬ 
tice.  the  net  charge  that  previously  resided  at  that  site  is 
redistributed  in  the  lattice.  This  results  in  a  modification  to 
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me  couiomo  energies  m  me  crystal,  both  to  the  Madelung- 
enhanced  K  and  the  average  local  electron-electron  interac¬ 
tion  energy  U.  Defining  Z,  as  the  charge  initially  residing  on 
an  atom  in  the  /th  shell  about  the  vacancy  site,  with  Z„  the 
charge  on  the  central  atom,  Z,  the  charge  on  a  first-neighbor 
atom,  etc.,  and  <5,  as  the  modification  to  the  charge  on  an  /th- 
neighbor  atom  when  the  vacancy  is  created,  the  contribution 
to  the  vacancy  formation  energy  due  to  the  electron-electron 
energy  AC^and  the  Madelung  energy  AA"  are  given  by 

+6,)-  -  (Z,)'],  (Al) 

I  in  2 

nz, +s,)(Z, +6^)  (Z)(Z,) 

V  '•.y  r, 

Here  /  and  j  sum  over  all  shells  of  atoms,  m,  and  sum  over 
the  atoms  within  the  /th  and yth  shell,  respectively,  and  is 
the  distance  between  the  m,  th  and  th  atoms.  The  prime  on 
the  sum  in  Eq.  ( A2 )  indicates  that  /  =  j  is  not  to  be  included 
in  the  sum.  Values  for  the  intra-atomic  interaction  param¬ 
eter  U,  differ  at  the  cation  and  anion  site  and  were  taken 
from  Ref  47. 

From  the  diagonalization  of  the  cluster  Hamiltonian 
containing  a  vacancy,  we  find,  for  most  systems,  that  the 
charge  redistribution  is  primarily  to  the  first-neighbor  shell 
about  the  vacancy.  Assuming  all  charge  shift  is  to  the  first- 
neighbor  sites,  the  above  expressions  reduce  to 

At/'=  -  ('6'„  +  Jf/,  )Zr,e-  (A3) 

AA  =  — Z,^e=.  {A4) 

d 

In  calculating  A/f  we  have  included  an  effective  reduction  in 
the  Madelung  constant  to  account  for  the  effects  of  the  finite 
spacial  extent  of  the  atomic  charge. 

APPENDIX  B:  LATTICE  RELAXATION 

When  a  vacancy  is  created,  the  atoms  about  the  vacancy 
site  will  in  general  relax  away  from  their  perfect  crystalline 
position.  Here  we  consider  only  radial  displacements  with 
respect  to  the  cluster  center,  Jahn-Teller  distortions,  which 
lower  energies  by  breaking  degeneracy,  are  not  addressed. 
For  the  unreiaxed  lattice  we  define  R„  as  the  distance  from 
the  cluster  center  to  the  nth  atom  shell.  The  relaxation  in  the 
cluster  containing  a  vacancy  is  then  characterized  by  <5„, 
defined  through  the  relationship 

r„  =/?„(! -<5„),  (Bl) 

with  /•„  defined  as  the  distance  from  the  center  to  the  nth- 
neighbor  atom  in  the  relaxed  cluster. 

The  r„  are  computed  by  minimizing  with  respect  to 
bond-length  changes.  In  the  energy  minimization,  the  elastic 
energy  of  the  lattice  beyond  the  cluster  is  included  as  an 
elastic  continuum.  Covalent  energy  changes  from  both 
bond-length  and  bond-angle  distortions  within  the  cluster 
have  been  mcludeO,  and  rigid  hybrids  nave  been  assumed. 
Here  (d„),  (JF,,  (d,,  )/dd,  and  d-V„  (d,,  )/dd'  for  the  ini¬ 


tial-state  cluster  were  fit  to  the  experimental  cohesive  ener¬ 
gy.  the  equilibrium  bond  length  dg,  and  the  experimenul 
bulk  modulus,  respectively.  For  small  bond-length  vana- 
tions  away  from  equilibrium,  F„  (d)  can  be  expressed  by  the 
Taylor’s  series  expansion 

Fo  (d)  =  F.,  (d„  )  -h  (d  -  d„  )  F'  (do  ) 

^Ud-do)'Fo(d„).  (B2) 

This  form  of  Fo  is  used  to  compute  the  overlap  energy  of 
final-state  bonds  of  length  d.  The  strain  energy  of  the  elastic 
continuum  is  calculated  from 

Etr  =  C  (B3) 

where  1  corresponds  to  the  outermost  shell  of  the  cluster  and 
C  is  an  elTective  shear  constant’'  given  by 

C=rr[1.6(C,,  -C,, ) -r4.8C«].  (B4) 

Experimental  values  for  C, ,  —  C,,  and  C44  have  been  used 

APPENDIX  C:  DANGLING  HYBRID  INTERACTION 

One  additional  correction  to  the  vacancy  formation  en¬ 
ergies  should  be  added,  because  of  the  Jahn-Teller  distortion 
at  the  vacancy  site  and  the  resulting  splitting  of  the  degener¬ 
acy  and  lowering  of  the  total .  energy.  This  interaction 
between  the  dangling  hybrids  at  the  vacancy  site  was  not 
included  in  the  cluster  Hamiltonian  because  only  first-neigh¬ 
bor  interactions  have  been  included  to  this  point.  For  large 
enough  cluster  sizes  ( three  or  more  shells  of  neighbors  about 
the  vacancy  site),  the  dangling  hybrid  states  are  narrowly 
split  into  a  singly  degenerate  A ,  state  and  a  triply  degenerate 
r,  state,  from  the  coupling  of  the  dangling  hybrid  levels 
through  unbroken  cluster  bonds.  The  direct  dangling  hybrid 
interaction  results  in  an  additional  splitting  of  the  levels  at 
the  Tj  symmetry  vacancy  site,  with  the  .4,  level  lowered  by 
an  amount  3o  and  the  T,  levels  raised  by  an  amount  a, 
where  a  is  the  magnitude  of  the  coupling  constant.  From 
self-consistent  Green’s  function  calculations,  Baraff,  Kane, 
and  Schliiter^*  found  a  total  of  A,  —  T.  splitting  of  1.7  eV 
for  the  vacancy  in  silicon.  Assuming  half  of  this  splitting 
results  from  the  Coulomb  interaction,  we  estimate 
a^,  =  0.21  eV.  To  estimate  the  contribution  of  the  splitting 
of  the  dangling  hybrid  energies  on  the  vacancy  formation 
energy  in  other  compounds,  the  value  of  a  for  Si  is  scaled 
according  to 


as  suggested  by  extended  Hiickel  theory.  Here  e,,  is  the  hy¬ 
brid  energy  for  the  cation  or  anion  adjacent  to  the  vacancy 
site.  Although  not  rigorous,  this  simple  scaling  rule  should 
yield  the  proper  trends  among  the  compounds. 
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Defects  in  ZnTe,  CdTe,  and  HgTe:  Total  energy  calculations 
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Total  energies  for  various  impurities  and  defects  in  HgTe,  CdTe,  and  ZnTe  are  calculated. 
Calculations  were  done  using  a  self-consistent  linear  muffin-tin  orbital  (LMTO)  method  within 
the  local  density  and  atomic  spheres  approximation.  We  calculate  the  total  energy  for  substitution 
on  both  lattice  and  interstitial  sites,  and  estimate  the  lattice  strain  energies.  Estimates  of  the 
variation  with  the  alloy  predict  a  linear  variation  of  the  substitution  energy  with  the  local 
concentration.  We  predict  that  the  Te  antisite  will  be  more  prevalent  in  all  three  of  the  compounds 
than  previously  thought.  The  problem  of  cross  doping  during  heteroepitaxy  on  GaAs  is  predicted 
to  be  greater  on  the  cation  sublattice. 


I.  INTRODUCTION 

Controlled  doping  of  HgCdTe  and  HgZnTe  during  bulk  and 
epitaxial  growth  is  an  important  aspect  of  device  fabrication 
from  these  materials.  Understanding  the  thermodynamics  of 
the  various  native  point  defects  and  impurities  substitution 
energies  is  a  key  step  towards  controlling  the  doping  in  these 
narrow-gap  alloys.  Equilibrium  between  the  native  defects 
and  impurities,  subject  to  a  specified  stoichiometry  and  ex¬ 
ternal  phases,  must  be  understood  in  order  to  deduce  ways  to 
improve  control  of  doping  in  these  materials.  Additionally, 
the  kinetics  of  defect  and  impurity  diffsion  can  be  important. 
Calculation  of  thermodynamic  properties  of  the  various  na¬ 
tive  and  impurity  point  defects  require  free  energies  (enthal¬ 
pies  and  entropies)  of  formation.  The  energies  reported  here 
are  the  appropriate  enthalpies.  These  energies  also  provide  a 
reference  point  for  the  discussion  of  more  complicated  phe¬ 
nomena,  such  as  dopant  incorporation  during  nonequilibri¬ 
um  epitaxial  growth  or  ion  implantation. 

Accurate  total  energies  of  defects  and  impurities  in  semi¬ 
conductors  are  difficult  to  calculate.  Even  more  challenging 
is  the  theo!  y  of  the  energy  positions  of  the  deep  levels  result¬ 
ing  from  such  defects.  Although  tight-binding  methods  have 
been  successful  in  studying  the  trends  of  deep  levels  as  a 
function  of,  for  example,  host  lattice  or  alloy  environment, 
experimental  guidelines  are  often  used  to  specify  the  impuri¬ 
ty  potential  itself '  Because  the  local  density  approximation 
(LDA)  yields  an  incorrect  band  gap,  identifying  the  energy 
levels  of  localized  states  in  the  band  gap  in  first-principle 
calculations  using  the  LDA  is  still  an  outstanding  problem. 
Additionally,  in  the  present  work,  the  use  of  supercells  leads 
to  a  large  dispersion  in  the  localized  states,  adding  to  diffi¬ 
culty  in  directly  calculating  the  deep  state  energies,  although 
attempts  have  been  made  to  do  so.’  In  the  present  work,  we 
do  not  focus  on  the  exact  positioning  of  the  localized  levels  in 
discussing  our  results,  although  general  trends  in  the  deep 
levels  are  discussed  in  some  cases. 

The  remainder  of  this  paper  is  organized  as  follows.  In 
Sec.  II,  the  theoretical  approach  is  summarized  and  refer¬ 
ences  to  the  appropriate  literature  are  made.  In  Sec.  Ill,  we 
present  results  for  an  array  of  native  defects  and  dopants  in 
HgTe,  CdTe,  and  ZnTe.  The  work  is  summarized  and  con¬ 
clusions  are  drawn  in  Sec.  IV. 


II.  THEORY 

The  self-consistent  linear  muffin-tin  orbital  (LMTO) 
method,  originally  developed  by  Andersen,’  is  used  to  calcu¬ 
late  the  substitution  energies  in  HgTe,  CdTe,  and  ZnTe.  As 
iscustomary,  the  LDA  to  the  exchange-correlation  energy  is 
used  to  construct  an  effective  one-electron  potential.  Within 
the  atomic  spheres  approximation,  LMTO  is  computation¬ 
ally  efficient  and,  unlike  the  pseudopotential  method,  is  suit¬ 
able  for  solids  composed  from  any  atom  in  the  periodic  table. 
We  use  a  minimal  (spd)  basis  set  on  atom-  and  space-cen¬ 
tered  atomic  spheres.  The  resulting  18  functions  per  atom 
are  further  reduced  by  orbital  downfolding’  of  all  if -orbitals 
except  those  on  the  cation  atomic  spheres.  Without  such  an 
optimal  basis  set,  the  large  data  base  of  calculations  present¬ 
ed  here  would  not  have  been  possible. 

Substitution  energies  are  calculated  from  a  difference  in 
total  energies  of  the  compound  with  and  without  the  defect 
or  impunty .  Large  unit  cells  of  32  spheres  with  one  impurity 
per  cell  are  repeated  periodically  to  form  a  superlattice  of 
impurities.  The  sensitivity  of  the  substitution  energies  to  the 
size  of  the  unit  cell  has  been  examined  and  errors  of  the  order 
of  0.2  eV  are  estimated.  The  periodic  array  of  impurities 
results  in  the  dispersion  of  the  impurity  states  into  bands  in  k 
space,  contributing  to  the  difficulty  in  identifying  the  deep 
state  energies.  As  a  check  of  the  present  supercell  approach, 
we  have  repeated  the  LMTO  Green’s  function  calculations 
done  by  Beeler  et  al.^  using  supercells,  and  we  find  similar 
conclusions  for  the  chalcogen  impurities  in  silicon. 

Because  the  lattice  strain  about  some  impurities  and  de¬ 
fects  can  be  substantial,  calculation  of  the  relaxation  energy 
is  necessary  to  yield  an  accurate  estimate  of  the  total  ener¬ 
gies.  In  the  present  calculation,  Methfessel’s  form  of  the 
force  theorem'’  is  used  to  calculate  the  pressure  on  each  su¬ 
percell  containing  a  single  defect  or  impurity.  An  upper  esti¬ 
mate  of  the  lattice-relaxation  energy  is  calculated  assuming 
all  bonds  in  the  supercell  respond  uniformly  to  this  pressure, 
characterized  by  the  bulk  modulus  of  the  host  lattice. 

The  atomic  spheres  approximation  (ASA),  as  used  in  the 
present  work,  also  substitutes  for  the  true  density  functional 
one  that  spheridizes  the  output  density.  In  consequence  of 
this,  calculation  for  defects  with  high  deviations  from  cubic 
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symmetry,  (e  g.,  defects  that  cause  large  strain)  must  be 
viewed  with  caution. 

III.  RESULTS 

For  the  pure  compounds  without  any  defects,  the  LMTO- 
ASA  was  found  to  give  cohesive  energies,  bond  lengths,  and 
bulk  moduli  in  good  agreement  with  experiment.  Calculated 
cohesive  energies  per  bond  are  —  1.28,  —  1.19,  and  —0.91 
eV  for  ZnTe,  CdTe,  and  HgTe,  respectively,  compared  to  the 
experimental  values  of  —  1.20,  —  1.10,  and  —  0.82  eV.  To¬ 
tal  energies  for  several  group  IB,  III,  and  V  dopants  in  the 
pure  compounds  HgTe,  CdTe,  and  ZnTe  are  summarized  in 
Table  I.  Also  shown  are  the  total  energies  for  the  antisite  and 
self-interstitials.  Two  nonequivalent  tetrahedral  interstitial 
sites  were  considered,  one  with  four  anion  nearest  neighbors 
/  (Te),  and  the  other  with  four  cation  nearest  neighbors 
/(Zn,  Cd,  or  Hg). 

The  total  energies  given  in  Table  I  are  referenced  to  initial 
and  final  states  where  the  excess  (or  exchanged)  atoms  are 

Table  I.  Total  energies  of  defect  formation.  Notation  isA  (B),whereAis 
the  defect  and  B  is  the  defect  site.  When  atoms  are  added  or  removed  in  the 
formation  of  a  defect,  the  free  atom  initial  and  final  states  are  used  as  a 
reference.  Sign  in  parentheses  following  the  energies  have  the  following 
meaning.  (  +  )  relaxation  is  outward  and  relaxation  energy  is  greater 

than  0.2S  eV;  (  +  )  relaxation  is  outward  and  relaxation  energy  is  less  than 
0.25  eV;  (.)  relaxation  is  negligible;  (  —  )  relaxation  is  inward  and  relaxa¬ 
tion  energy  is  less  than  0.25  eV. 


Total  energy  (eV) 


Defect 

ZnTe 

CdTe 

HgTe 

Group  IB  impurities 

Cu  (cation) 

-  I.4I(  H-) 

-0.98(  -  ) 

-2.2I(- 

) 

Ag  (cation) 

-0.28(  +  ) 

-  0.62(  -  ) 

-  1.92(  - 

) 

Au  (cation) 

-0.53(.) 

-  I.02(  -  ) 

t 

04 

04 

1 

) 

Group  III  impurities 

AI  (cation) 

-  I.04(  ) 

-  I.57(.) 

-  3.02(  - 

) 

Ga  (cation) 

-0.25(-^) 

-0.77(.) 

-2.02(.) 

In  (cation) 

+  0.80(  ) 

-0.44(  +  ) 

-  I.93(  + 

) 

Group  V  impurities 

P  (Te) 

-h  I.26(-) 

-t-  I.39(-) 

-1-  I.07(  - 

) 

As (Te) 

-1-  I.IO(-) 

-l-I.23(-) 

-i-  0.80(  - 

) 

Sb  (Te) 

-i-0.95(-) 

-(-  I.08(.) 

-(-  0.48  (  - 

) 

P  (cation) 

-(-I.45(  +  ) 

-t-0.74(.) 

-0.56(.) 

Sb  (cation) 

-1-  3.07{  +  +  ) 

-1-  I.5I(-) 

-  0.36(  + 

) 

Impurity-interstitials 

P  (/  (cation)] 

-i-0.0I(  +  ) 

-0.17(  +  ) 

-0.4I(  + 

) 

P  l/(Te)l 

+  0.54(  +  +  ) 

-1-  0.05(  -1- ) 

-  0.90(  -1- 

) 

In  [/  (cation)  1 

-0.83(  + 

■•-) 

In  [/(Te)] 

-0.82(  + 

-v) 

Self-interstitials 

Cation  [/(cation)]  -)-0.43(  +  -i- ) 

+  0.87{  -1.  +  ) 

-)-0.98(  + 

+  ) 

Cation  [/(Te)] 

-l-0.78(  +  -f ) 

+  I.03(  -1-  -I- ) 

+  0.84(  + 

+  ) 

Te  [/  (cation)] 

-l-2.6I(  -1-) 

+  I.45(-i-  +) 

-1-  l.33(  + 

+  ) 

Te[/(Te)] 

-l-4.28(  +  +) 

+  2.63(  +  ) 

-1-  I.32(-h 

+  ) 

Antisites 

Cation  (Te) 

-l-4.62(  -I- ) 

-(-3.94(.) 

-l-3.22(.) 

Te  (cation) 

+  4.59(  *  ) 

-1-  2.52{  ) 

+  0.89(  * 

) 
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taken  to  free  atom  states.  For  example,  for  the  anion  antisite 
defect  in  HgTe,  the  initial  state  is  bulk  HgTe  with  a  free  Te 
atom  and  the  final  state  is  bulk  HgTe  with  one  antisite  defect 
and  a  Hg  in  the  free-atom  state.  This  serves  as  a  convenient 
reference  from  which  other  energies  of  interest  can  be  easily 
obtained.  We  caution  against  a  direct  interpretation  of  the 
numbers  in  this  table,  because  the  relative  and  absolute  mag¬ 
nitudes,  as  well  as  the  sign  of  the  energies,  may  change  w  hen 
another  reference  state  is  chosen.  Perhaps  of  more  interest 
for  some  cases  is  when  the  impurities  are  taken  from  the  pure 
solids,  rather  than  the  free-atom  state.  This  reference  state 
was  calculated  for  several  of  the  impurities,  and  substitution 
energies  are  given  in  Table  II.  For  impurity  doping  during 
LPE  and  bulk  growth,  the  liquid  final  state  is  the  relevant 
one.  and  can  be  obtained  if  the  energy  of  the  atoms  in  the 
liquid  phase  are  known. 

A.  Cu,  Ag,  and  Au  impurities 

We  first  consider  a  situation  in  which  a  group  IB  metal 
(Cu,  Ag,  or  Au)  is  deposited  as  a  solid  on  the  surface  of  the 
II-Vl  compound  of  interest,  and  ask  which  has  the  lowest 
energy  for  substitution  on  the  cation  lattice.  From  Table  II 
we  find  that  Ag  has  the  lowest  substitution  energy  in  HgTe 
and  CdTe,  while  Cu  has  the  lowest  energy  for  ZnTe.  Implicit 
in  these  energies  is  the  assumption  that  the  replaced  cation 
escapes  to  the  vapor  and  equilibrium  is  achieved  (i.e.,  rates 
of  diffusion  are  not  relevant). 

The  above  ease  of  substitution  will  be  changed  if  the  impu¬ 
rity  is  being  introduced  from  the  vapor.  In  this  case,  Ag  will 
substitute  most  easily  in  ZnTe  and  CdTe.  while  Au  will  sub¬ 
stitute  most  easily  in  HgTe.  For  ZnTe  and  CdTe,  which  have 
relatively  wide  band  gaps,  the  zero-temperature  substitution 
energies  for  the  intrinsic  compound  may  be  altered  as  a  de¬ 
pendence  on  the  Fermi  energy  and  deep-state  energies  enters 
in  the  substitution  enthalpies. 

Although  the  above  conclusions  are  based  on  enthalpy 
consiaerations  alone,  the  conclusions  still  hold  if  the  three 
metals  are  compared  for  the  same  external  conditions,  i.e., 
temperature  and  Hg  pressure.  The  substitution  energies  are 
only  one  aspect  needed  to  understand  the  diffusion  of  these 

Table  II.  Total  energies  of  defect  formation.  Notation  is  A  (B),  where  A  is 
the  defect  and  B  is  the  defect  site.  For  Te  and  impurity,  the  pure  solid  is  used 
as  a  reference;  the  free-atom  state  is  used  as  the  final  state  for  the  Zn,  Cd,  or 
Hg  cation.  Energies  for  the  elemental  solids  are  taken  from  expenment. 


Total  energy  (eV) 


Defect 

ZnTe 

CdTe 

HgTe 

Cu  (cation) 

-f2.08 

+  2.51 

+  1.28 

Ag  (cation) 

-).2.67 

+  2.33 

+  1.03 

Au  (cation) 

-i-3.27 

+  2.78 

+  1.47 

A]  (cation) 

-f-2.35 

+  1.82 

+  0.37 

Ga  (cation) 

+  2.56 

+  2.04 

+  0.79 

In  (cation) 

+  3.32 

+  2.08 

+  0.59 

P  (Te) 

+  2.50 

+  2.53 

+  2.31 

As  (Te) 

+  1.87 

+  2.00 

+  1.57 

Sb  (Te) 

+  1.51 

+  1.64 

+  1.04 

P  (cation) 

+  4.88 

+  4.17 

+  2.87 

Sb  (cation) 

+  5.82 

+  4.26 

+  2.39 
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metals  into  the  compounds.  The  rates  of  diffusion,  as  well  as 
other  chemical  reactions  (e.g.,  formation  of  metal  tellu- 
ndes)  must  be  included  for  a  full  description  of  the  behavior 
of  these  metals  with  the  compounds. 


B.  Al,  Ga,  and  In  impurities 

Again  we  first  consider  the  case  where  Al,  Ga,  or  In  solid 
is  deposited  on  the  surface  and  ask  the  relative  ease  for  sub¬ 
stitution  into  the  compound  on  the  cation  sublattice.  From 
the  energies  in  Table  II,  we  find  that  Al  w  ill  substitute  most 
easily  on  the  cation  lattice  for  all  three  compounds:  ZnTe, 
CdTe,  and  HgTe.  In  CdTe  and  HgTe,  In — an  oft-employed 
dopant  in  HgCdTe — also  has  a  low  substitution  energy;  in 
ZnTe,  because  of  a  large  size  mismatch,  the  In  substitution 
energy  is  substantially  larger  than  that  of  Al  and  Ga.  Based 
on  these  results,  one  would  expect  the  doping  of  HgZnTe 
w  ith  this  class  of  impurities  will  be  more  difficult  than  in 
HgCdTe.  Because  all  of  the  energies  for  the  group  III  impur¬ 
ities  in  HgTe  are  less  than  the  respective  impurities  in  ZnTe 
and  CdTe,  we  predict  that  the  equilibrium  doping  in  HgTe 
(and  thus  HgCdTe  and  HgZnTe)  will  be  easier  than  in  the 
pure  compounds  CdTe  and  ZnTe. 

When  the  free-atom  final  states  are  considered,  the  group 
III  elements  are  predicted  to  be  stable  in  ZnTe,  CdTe,  and 
HgTe,  i.e.,  their  binding  energies  are  less  than  zero,  with  the 
exception  of  the  highly  strained  In  impurity  in  ZnTe.  For  the 
In  going  into  the  interstitial  sites  in  HgTe,  the  energies  are 
also  less  than  zero,  indicating  the  relative  stability  of  these 
sites  in  HgTe,  and  the  mercury-rich  alloys  HgCdTe  and 
HgZnTe.  Thus,  if  the  In  atoms  are  delegated  to  the  intersti¬ 
tial  sites  during  epitaxial  growth  (e  g.,  via  kinetic  processes 
on  the  growth  surface),  the  interstitial  sites  will  be  relatively 
stable  for  these  atoms. 

Consider  one  further  situation  in  which  the  group  III  ele¬ 
ment  is  incorporated  interstitially  into  the  compound,  either 
through  ion  implantation  or  during  epitaxial  growth.  We  ask 
then  how  much  energy  does  it  take  to  exchange  the  intersti¬ 
tial  with  an  atom  on  a  cation  lattice  site,  leaving  the  cation  in 
an  interstitial  site  and  the  impurity  on  a  lattice  site.  For  ex¬ 
ample,  for  In  in  HgTe,  we  consider  the  following  reaction; 

In, -f  HgHg-Inng  +  Hg,.  (1) 

The  energy  for  this  reaction  is 

E  =  —  £|n,,)  -f  £|n,Hg) 

=  -0.2eV.  (2) 

Although  this  reaction  is  found  to  be  exothermic  and  there¬ 
fore  is  predicted  to  proceed  based  on  total  energy  consider¬ 
ations  alone,  this  conclusion  is  not  within  the  accuracy  of  the 
present  calculation.  Additionally,  activation  barriers  may  be 
large  because  of  the  atom-swapping  process.  Additionally, 
the  equilibrium  with  the  other  native  defects  such  as  vacan¬ 
cies  and  antisites  must  be  considered  before  predicting  if  the 
reaction  will  proceed.  The  answer  in  any  particular  case  may 
well  depend  on  the  detailed  initial  state  of  the  solid,  e.g.,  the 
vacancy  concentrations,  dislocation  density,  or  carrier  type. 


C.  P,  As,  and  Sb  impurities 

Because  the  group  V  elements  are  slow  diffusers,’  it  is  just 
this  class  of  impunties  that  is  desired  to  achieve  impurity 
controlled  p-type  doping  in  HgCdTe.  We  find  that  the  Sb 
substitution  energy  is  smallest,  while  P  is  largest  in  all  three 
compounds.  The  differences  between  the  substitution  ener¬ 
gies  in  ZnTe,  CdTe,  and  HgTe  are  not  as  pronounced  as  was 
seen  for  impurity  substitution  on  the  cation  site  Substitution 
energies  for  P  and  Sb  on  the  cation  sublattice  have  also  been 
calculated.  Note  that  in  HgTe  the  substitution  energy  for  P 
does  not  differ  significantly  on  the  Hg  and  Te  sublattice. 
Additionally,  we  see  from  Table  I  that  the  interstitial  energy 
of  P  with  respect  to  solid  P  is  comparable  to  the  lattice  substi¬ 
tution  energies.  This  result  is  consistent  with  the  amphoteric 
behavior  of  this  element  in  HgTe,  observed  by  Vydyanath.*' 

We  also  note  that  the  substitution  energies  for  all  the 
Group  V  impurities  are  largest  in  CdTe.  This  suggests  that 
doping  may  be  more  difficult  in  HgCdTe  than  in  HgZnTe. 
This  conclusion  is  based  on  preliminary  calculations,  which 
indicate  that  the  substitution  energies  of  this  class  of  defects 
varies  linearly  with  composition  in  the  alloy. 

D.  Native  defects:  Accommodation  of  deviations 
from  stoichiometry 

Deviations  from  stoichiometry  can  be  accommodated  by 
a  number  of  native  defects:  aniisites.  interstitials,  vacancies, 
second  phase  precipitates,  and  extended  defects  (e  g.,  point 
defect  complexes,  dislocations).  As  discussed  by  Schaake 
and  Tregilgas.''  which  of  these  defects  dominates  is  deter¬ 
mined  both  by  equilibrium  considerations  and  the  rate  at 
which  equilibrium  is  established. 

We  considered  the  relative  enthalpies  for  accommodation 
of  a  specified  nonstoichiometry  by  the  first  four  defects 
named  above.  We  arbitrarily  choose  the  reference  state  as 
that  of  the  pure  compound  with  excess  Te  accommodated  by 
Te  solid.  With  respect  to  this  reference  state,  the  energy 
change  for  the  fourth  mode  of  accommodation  is  zero.  The 
relevant  reaction  to  accommodate  one  excess  Te  by  the  for¬ 
mation  of  an  Te  antisite  is 

Te,  — iTeHgTe^j.  (3) 

The  energy  for  this  reaction  is  given  by  0.5  (^TtiHg)  —  2£,, 
-)-  ),  where  Ej^  and  E^  are  the  absolute  magnitudes  of 

the  energy  per  atom  of  Te  solid  and  energy  per  bond  for  the 
HgTe  compound,  both  with  respect  to  the  free  atoms.  To 
accommodate  one  excess  Te  by  the  formation  of  a  Te  inter¬ 
stitial,  i.e., 

Te,-Te„  (4) 

an  energy  £Tei/(T«orHg)  i  +  ^Te,  necessary.  To  calculate 
the  energy  to  accommodate  the  nonstoichiometry  by  Hg  va¬ 
cancies,  we  use  the  experimental  values  for  the  Hg  vacancy- 
formation  energy  from  Vydyanath,'®  ^^(Hg)  =  2  2  eV,  and 
the  experimental  cohesive  energy  per  bond  of  0.82  eV  for 
HgTe.  The  relevant  equation  for  the  above  is 

Te,  —  I  Fn^Te^^, ,  (5) 

with  energy  given  by  f^Hgi  —  2£,,  +  E^^.  The  energies  for 
the  above  equations,  as  well  as  those  for  CdTe  and  ZnTe,  are 
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Table  HI.  Defect  energies  to  accommodate  excess  Te  (or  Hg,  Cd  or  Zn 
deficiency)  Energies  are  referred  to  the  pure  stoichiometnc  compound 
with  the  nonstoichiometry  accommodated  by  solid  Te,  and  are  per  excess 
Te.  The  two  values  shown  for  the  interstitial  correspond  to  the  cation-  and 
anion-imerstitial  sites. 


Native  defect 

Total  energy  (eV ) 

ZnTe 

CdTe 

HgTe 

Te  antisites 

-i-  3.21 

-(-  2.29 

-i-  1  75 

Te  interstitials 

-1-  4.82.  -  6.50  -r  3  67.  4  84 

-t-  3  57,  ^  3.57 

Canon  vacancies 

-)-  4.90 

-1-4  70 

-(-  2.78 

Te  precipitates 

0.00 

0.00 

0.00 

summarized  in  Table  III.  Vacancy-formation  energies  for 
CdTe  and  ZnTe  were  taken  from  experiment"  and  a  tight- 
binding  calculation,'’  respectively. 

Our  results  indicate  that  the  antisite  has  a  lower  enthalpy 
of  formation  than  the  vacancy  in  all  three  compounds,  and 
that  the  energy  for  the  interstitial  is  comparable  to  or  greater 
than  that  for  the  vacancy.  The  Te  antisites  and  interstitials 
both  exhibit  large  strain  energy,  an  estimate  of  which  has 
been  included  in  the  energies  in  Table  III.  These  strain  con¬ 
tributions  are  estimates,  and  thus  the  conclusions  reached 
here  must  be  viewed  as  somewhat  tentative.  The  configura¬ 
tional  entropy  contribution  to  the  free  energy  for  the  anti¬ 
sites  and  vacancies  are  the  comparable,  and  our  results  indi¬ 
cate  that  the  antisite  should  be  abundant.  If — as  expected  on 
simple  considerations — the  Te  antisite  is  a  donor,  it  could  be 
a  major  contributor  to  the  residual  donor  in  Hg  annealed 
samples.  If  both  vacancies  and  antisites  are  present,  compen¬ 
sation  will  occur  and  a  net  p-  or  n-type  material  can  result, 
depending  on  the  defect-state  structure  of  each  material. 
Finally,  the  Te  antisites  may  have  associated  with  them  deep 
levels  in  the  band  gap,  most  probably  donorlike.  These  may 
be  the  deep  states  at  0.4  £j,  and  0.7  where  is  the  band 
gap  energy,  present  in  undoped  HgCdTe,  which  appear  don¬ 
orlike.  "  We  are  currently  confirming  the  results  of  the  total 
energy  calculations  using  a  full-potential  LMTO,  in  which 
the  ASA  is  removed.  For  now,  we  conclude  that  the  antisite 
is  probably  more  abundant  than  previously  thought,  and 
that  it  may  be  the  source  of  the  donorlike  deep  states  in 
undoped  HgCdTe. 

E.  Heteroepitaxy  on  GaAs 

To  study  the  tendency  for  cross  doping  across  a  GaAs/II- 
VI  ep'layer  interface,  we  have  calculated  the  energies  for 
interchange  of  cations  or  anions.  For  example,  for  a  CdTe 
epitaxial  layer  on  a  GaAs  substrate,  we  must  consider  the 
reactions 


Cdcd  -l-  Ga^ja  — •  Cdoa  -1-  Ga^^ 

(6) 

and 

TCt,  -1-  As^^-Te.^,  -1-  Ast.- 

(7) 

Table  IV  summarizes  calculated  substitution  energies  for 
Zn,  Cd,  Hg,  and  Te  in  GaAs.  The  energies  of  the  above  reac¬ 
tions  for  ZnTe,  CdTe,  and  HgTe  on  GaAs  are  also  given  in 
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Table  IV  Total  energies  of  defect  formation  for  Zn.  Cd.  Hg.  and  Te  in 
GaAs  Reference  is  same  as  in  Table  I 


Total 

energy 

(eV) 

Defect/defect  reaction 

ZnTe 

CdTe 

HgTe 

Canon  1  Ga) 

^  2  14 

-1-  3.25 

-1-4.61 

Te  (As) 

-1-  2  92 

-t-  2.92 

-r  2  92 

Anion  exchange  [  Etj.  ( 7 ) ) 

-1-  1  89 

^  2  48 

+  2.59 

Cation  exchange  [Eq  (6) ) 

-r  3  93 

-e4.15 

-r  3.72 

Table  IV.  All  of  the  energies  for  the  interchange  reactions 
are  positive,  indicating  that,  based  on  the  enthalpy  contribu¬ 
tion  to  the  free  energy,  the  reactions  will  be  inhibited  (endo¬ 
thermic).  Thus,  the  enthalpies  of  reaction  will  not  tend  to 
drive  cross-doping  heteroepitaxy  in  this  system.  Because  the 
entropy  contributions  for  cation  and  anion  lattice  exchange 
are  of  the  same  form,  and  because  the  cation  energies  are 
smaller,  we  can  conclude  that  cross  doping  on  the  cation 
sublattice  will  be  more  dominant  than  on  the  anion  sublat¬ 
tice.  Note  that  the  calculated  enthalpies  do  not  include  the 
strain  expected  to  be  present  on  the  lattice-mismatched  in¬ 
terface. 


We  are  currently  developing  several  improvements  to  the 
present  work,  which  should  result  in  more  accurate  esti¬ 
mates  of  the  total  energies.  First,  the  lattice  relaxation  ener¬ 
gies  were  estimated  using  the  total  pressure  on  a  supercell, 
assuming  that  all  the  bonds  in  the  lattice  respond  uniformly, 
characterized  by  the  bulk  modulus.  Because  the  strain  is  lo¬ 
calized  in  the  bonds  nearest  to  the  misfit  impurity  or  intersti¬ 
tial,  the  bonds  nearer  the  defect  will,  in  fact,  relax  more  than 
those  far  from  the  defect.  We  are  currently  working  on  this 
problem.  Second,  the  identification  of  the  energy  position  of 
any  deep  states  produced  by  the  defects,  such  as  the  antisites 
ard  interstitials,  remains  a  challenge.  The  first  improvement 
to  the  current  work  is  the  implementation  of  a  Green’s  func¬ 
tion  approach  to  calculate  the  properties  of  truly  isolated 
defects  in  the  compounds.  Unfortunately,  until  the  source  of 
the  band  gap  problem  in  the  LDA  is  resolved,  at  most  one 
can  expect  from  such  calculations  is  trends  in  the  deep  levels, 
not  absolute  energies  with  respect  to  the  band  edge.  Finally, 
the  variation  of  substitution  energy  with  alloy  concentration 
needs  to  be  investigated  further.  Preliminary  calculations 
indicate  that  the  substitution  energies  vary  linearly  with  al¬ 
loy  composition. 

Several  conclusions  can  be  drawn  from  the  above  results. 

(i)  In  appears  relatively  stable  in  the  interstitial  sites,  per¬ 
haps  explaining  the  difficulty  in  obtaining  active  In  during 
epitaxial  growth,  where  nucleation  and  kinetic  effects  may 
favor  interstitial  incorporation  of  In. 

(ii)  Enthalpy  considerations  indicate  that  interstitital  In 
may  be  favored  to  exchange  with  Hg  on  lattice  sites. 

(iii)  P  and  Sb  energies  on  the  cation  and  anion  sublattice 
and  interstitial  sites  in  HgTe  are  all  comparable,  consistent 
with  the  experimentally  observed  amphoteric  behavior  of 
these  elements,  as  a  function  of  Hg  pressure. 


IV.  CONCLUSIONS  AND  FUTURE  WORK 
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( iv )  The  Te  antisite  is  predicted  to  be  more  abundant  than 
previously  expected  in  ZnTe,  CdTe,  and  HgTe.  The  Te  anti¬ 
site  may  be  responsible  for  the  donorlike  deep  states  at  0.4  E, 
and  0.7  in  the  HgCdTe. 

( V )  The  energy  to  remove  a  Hg  atom  from  a  lattice  site  to 
the  vacuum  is  smaller  than  to  remove  a  Hg  atom  from  a 
lattice  site  to  an  interstitial. 

(vi)  Bised  on  preliminary  calculations  of  the  alloy  vari¬ 
ation  of  the  substitution  energies,  we  predict  that  doping  of 
the  group  V  elements  will  be  easier  in  HgZnTe  than  in 
HgCdTe. 
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ABSTRACT 

We  review  our  recent  work  on  the  surface  binding 
energies  for  HgTe  and  CdTe  on  the  (111)  and  (HI) 
surfaces.  We  find  that  the  surface  binding  energies 
are  not  simply  proportional  to  the  number  of  sur&ce 
bonds  times  the  bulk  cohesive  energy  per  bond,  bu> 
rather  depend  on  the  ionicity  and  band  gap  of  the 
compound.  From  the  surface  binding  energies  for  an 
isolated  atom  and  an  atom  in  the  middle  of  an  island, 
we  have  deduced  an  effective  surface-dependent  pair 
interaction,  which  is  found  to  be  attractive  for  HgTe 
and  repulsive  for  CdTe.  Concequences  of  these  inter¬ 
actions  on  the  nature  of  the  growth  are  discussed. 


1.0  INTRODUCTION 

Molecular  beam  epitaxial  (MBE)  growth  of  HgCdTe  has  established  itsdf  as  an  impor¬ 
tant  method  for  the  fabrication  of  superlattices,  abrupt  hetero junctions  and  other  novel 
device  structures.  However,  growth  by  MBE  is  plagued  by  a  number  of  difficulties.  Mate¬ 
rial  growth  rate,  stoichiometry  and  quality  axe  determined  by  many  parameters,  including 
the  orientation  of  the  growth  surface,  substrate  temperatures  and  the  beam  fluxes.  In  ad¬ 
dition,  dopant  incorporation  and  activity  in  MBE  growth  differ  markedly  £n>m  traditional 
bulk  growth  from  the  liquid,  and  may  vary  with  substrate  orientation.  Optimization  of 
growth  parameters  in  the  absence  of  any  microscopic  insight  as  guidance  is  a  difficult  as 
well  as  an  unsatisfying  task.  Much  insight  has  been  gained  through  experimental  programs. 
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some  of  which  include  powerful  surface- analytic  techniques  such  as  RHEED.  Another  ap¬ 
proach  has  been  to  understand  the  MBE  growth  through  theoretical  models.  The  goal  of 
these  models  has  been  to  capture  the  essential  physics  of  this  admittedly  complex  process 
and  to  provide  guidance  into  the  effects  of  variations  within  some  experimental  parameter 
space.  An  important  input  to  these  models  is  the  binding  energies  of  the  constituent  atoms 
on  the  surface.  To  our  knowledge,  aU  of  these  models  assume  surface  energies  which  are 
calculated  by  multiplying  the  number  of  bonds  being  made  to  the  surface  times  the  bulk 
cohesive  energy  per  bond.  We  have  removed  this  assumption. 

In  this  paper  we  summarize  our  recent  work  on  surface  energies  for  growth  models  of 
the  constituent  compounds  of  HgCdTe,  HgTe,  and  CdTe,  on  the  (111)  and  (III)  surfaces. 
The  results  presented  here  have  been  published  previously zind  the  reader  is  referred 
there  for  further  detail. 


2.0  THEORETICAL  APPROACH 

We  begin  by  defining  several  surface  energies  of  interest  for  the  growth  of  a  compound 
AB.  Consider  the  (111)  surface  which  is  terminated  with  cations  (A)  which  ^lre  triply 
bonded  to  the  layer  below.  When  an  isolated  anion  (B)  bonds  to  this  surface  it  maices  one 
bond  to  the  surface.  We  refer  to  the  energy  needed  as  the  dilute  surface  binding  energy, 
£’^(111).  Similarly,  the  binding  energy  of  an  isolated  A  atom  on  the  (III)  surfaces  is 
£^(111).  Another  energy  of  interest  in  the  modeling  of  the  growth  process  is  the  energy 
of  binding,  not  of  an  isolated  atom,  but  rather  an  atom  in  the  middle  of  an  island  of  like 
atoms.  We  refer  to  this  as  the  binding  energy  in  the  concentrated  limit.  For  example  the 
binding  energy  of  a  singly  bonded  B  atom,  in  the  middle  of  a  B-atom  island,  to  the  (111) 
surface  is  denoted  by  £^(111).  Thus,  considering  these  two  surface  concentration  limits 
only,  we  calculate  eight  energies  for  each  compound  (including  the  energies  for  the  atoms 
that  are  triply  bonded  to  the  layer  below. 

From  the  dilute  and  concentrated  binding  energies  for  an  atom  on  a  given  surface,  we 
can  deduce  the  effective  surface-  pair  interaction  energy.  For  example,  by  comparing 

1.  Krishnamurthy,  S.,  M.A.  Berding,  A.  Sher,  and  A.-B.  Chen,  Physical  Review  Letters 
64,  2531  (1990). 

2.  Krishnamurthy,  S.,  M.A.  Berding,  A.  Sher,  and  A.-B.  Chen,  Accepted  for  publication 
in  the  Journal  of  Applied  Physics. 

3.  Berding,  M.A.,  S. Krishnamurthy,  A.  Sher,  and  A.-B.  Chen,  Journal  of  Applied  Physics 
67,  6175  (1990). 
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£■8(111)  and  £8(111),  we  note  that  both  energies  include  the  binding  to  the  underlying 
layer  and  their  difference  is  in  the  in-plane  bonds.  On  the  (111)  surface,  these  six  in¬ 
plane  bonds  are  second-neighbor  bonds,  and  the  in-plane  pair  interaction  energy  is  given 
by  .Ep  =  — ^  ■  We  have  calculated  the  surface  binding  energies  using  a  tight-binding 
Green’s  function  method  which  is  especially  suitable  for  systems  with  planar  G^'mmetry, 
such  as  superlattices  and  surfaces.  This  method  has  also  been  used  to  calculate  the  cleavage 
energies,  yielding  good  agreement  with  experiment^.  The  ideal  surface  Green’s  function 
is  perturbed  by  the  removal  or  the  addition  of  an  atom  to  the  surface  for  the  calculation 
of  the  concentrated  and  dilute  surface  energies,  respectively.  The  change  in  the  density  of 
states  resulting  from  this  perturbation  is  used  to  calculate  the  change  in  electronic  energy 
upon  removal  or  addition  of  an  atom.  Calciilational  details  are  given  elsewhere^ 

3.0  RESULTS 

Surface  binding  energies  for  the  (111)  and  (III)  surfaces  are  calculated  for  HgTe  and 
CdTe.  Results  are  given  in  Table  1.  Also  shown  ne  the  bulk  cohesive  energies  per  bond,  E^. 
From  the  dilute  and  concentrated  energies  an  effective  surface  pair  interaction  is  deduced 
and  these  energies  zire  also  summarized  in  Table  1. 

TABLE  1  :  SUREACE  BINDING  AND  PAIR  INTERACTION  ENERGIES 


Ed(111)  Ec(111) 

Ep 

EddH)  Ec(TTT) 

Ep 

eV 

eV 

-_2V _ 

eV 

eV 

eV 

CM 

HgTe 

Hg 

0.3 

-1.1 

CO 

CM 

O 

■ 

0.2 

-0.3 

-0.08 

•0.8 

Te 

-1. 8 

-3.8 

-0.33 

-2.8 

-2.8 

0 

-0.8 

CdTe 

ca 

-2.7 

-0.6 

0.35 

-2.2 

-1 .3 

0.15 

-1.1 

Te 

-3.9. 

-4.2 

-0.05 

-5.2 

-2.1 

0.52 

-1.1 

We  note  several  features  of  the  binding  energies  in  Table  1  and  their  concequences. 
First  we  see  that  the  surface  binding  energies  are  not  simply  proportional  to  the  number 
of  bonds  being  made  to  the  substrate.  This  is  evident  from  the  fact  that,  for  example, 
E^(lll)  ^  E^(lll)  1  •  Ej,  one  being  the  number  of  bonds  being  made  to  the  surface. 
Also,  as  expected,  the  magnitude  of  the  surface  binding  energies  is  larger  for  CdTe  than  for 
HgTe,  consistent  with  the  difference  in  bond  strengths  in  these  compounds,  and  the  anions 
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are  more  strongly  bound  to  the  surface  than  the  cations.  Furthermore,  we  note  that  in 
some  caises  Ej:>  Ec,  implying  that  m  isolated  atom  is  less  strongly  bound  to  the  surface 
than  an  atom  in  an  island,  which  is  what  one  might  expect  based  on  the  assumption  of 
attractive  second- neighbor  bonds,  while  in  other  cases,  E^  <  Ec,  implying  that  an  isolated 
atom  is  more  strongly  bound  to  the  surface.  The  former  czise  correspond  to  Ep  <0  and  an 
effective  attractive  surface-  pair  interaction,  and  the  latter  ca^e  to  Ep>  Q  and  an  effective 
repulsive  surface-pair  interaction 

The  surface  pair  energies  in  general  are  found  to  be  repulsive  for  CdTe  and  attractive 
for  HgTe.  These  results  were  explained^  from  differences  in  the  charge  uransfer  from 
cation  to  anion  surface  states,  which  is  more  pronounced  in  systems  with  larger  band  gaps. 
Furthermore  we  have  found  that  the  repulsive  energies  can  be  associated  with  more  ionic 
compounds  with  large  differences  between  the  amion-  amd  cation  derived  s\irface  states. 

The  difference  in  the  nature  of  surface  interactions  should  manifest  itself  in  the  nature 
of  growth  surfaces  at  low  temperatures.  By  low  temperatures  we  refer  to  temperatures 
less  than  some  criticaJ  temperature,  Te,  which  corresponds  to  some  surface  order-disorder 
transition.  At  temperatures  above  Tc,  entropy  will  dominate  and  for  surfau:es  with  both 
attractive  and  repulsive  effective  pair  energies  the  total  surface  will  be  largely  disordered. 
Below  the  critical  temperature  the  surface  will  be  largely  ordered,  consisting  of  islands 
of  atoms  when  Ep  <  0  and  consisting  of  ordered  arrangements  of  atoms  minimizing  the 
nximber  of  in- plane  atom  pairs  when  £p  >  0. 

4.0  CONCLUSIONS 

Although  the  above  discussion  leads  to  some  insight  into  the  different  behavior  of  the 
growth  surfaces  expected  for  HgTe  and  CdTe  for  T  <Tc,  the  energies  must  be  incorporated 
into  a  more  complete  growth  model  that  includes  the  effects  of,  for  ex2Lmple,  surface-atom 
mobility.  We  can  extrapolate  to  the  alloy  system  Hgi_xCdx'Te  to  deduce  the  nature  of 
the  surface-pair  interactions  there.  Because  all  infrared  applications  of  this  alloy  are  for 
low  X  values  and  thus  narrow  energy  band  gaps,  we  expect  the  surface  energies  to  be 
largely  attractive,  implying  layer  growth  through  island  formation.  Some  differences  can 
be  expected  between  the  two  cation  species  on  the  surfaces.  We  are  currently  investigating 
these  differences. 

This  work  has  been  supported  by  ONR  contract  N00014-88-C0096  and  NASA  contract 
NAS1-1S226. 
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Srinivasan  Krishnamurthy,  M.A.  Berding,  A.  Sher 

SRI  Iniernattonai,  Menlo  Park,  California  94025.  USA 


and 


A.-B.  Chen 

Physics  Depanmem,  Auburn  University.  Auburn,  Alabama  36849.  USA 


The  atomic  distributions  on  surfaces  of  Si,  GaAs.  HgTe  and  CdTe  are  studied  as  functions  of  temperature  and  substrate 
orientation  in  the  [1(X)).  (Ill)  and  (111]  directions.  Surface  entropy  is  calculated  within  the  quasichemical  approximation  and  the  pair 
interaction  energies  are  obtained  using  the  tight-binding  Green's  function  method.  In  most  cases  considered,  the  interactions  between 
the  atoms  are  attractive  and  they  tend  to  congregate  into  islands  for  submonolayer  coverage  at  temperatures  below  the  roughening 
transition  temperatures.  Free  energy  curves  for  double-layer  growth  are  presented.  We  find  that  growth  in  this  case  is  mostly  layer  by 
layer  as  is  often  observed  in  atomic  layer  or  molecular  epitaxial  growth.  However,  in  cases  such  as  Ga  tenmnated  (Til)  GaAs 
surfaces  and  most  CdTe  surfaces,  the  in-plane  interactions  are  predicted  to  be  repulsive.  The  calculated  order -disorder  transition 
temperatures  for  these  cases  are  often  much  larger  than  usual  growth  temperatures  and.  consequently,  incompletely  filled  surfaces  are 
expected  to  have  domains  in  which  atoms  and  vacancies  arrange  themselves  in  superlattice  patterns. 


1.  Introduction 

Properties  of  semiconductor  surfaces  during  ep¬ 
itaxial  growth  are  sensitively  dependent  on  the 
nature  of  the  interactions  between  surface  atoms. 
When  the  interaction  between  the  atoms  is  attrac¬ 
tive.  the  surface  grows  with  formation  of  islands  at 
temperatures  below  a  critical  temperature  known 
as  roughness  transition  temperature.  However, 
when  the  interaction  between  the  atoms  is  repul¬ 
sive,  the  surface  undergoes  an  order-disorder 
transition.  At  growth  temperatures  below  the 
order- disorder  transition  temperature,  surface 
atoms  and  vacancies  arrange  themselves  in  a  su¬ 
perlattice  pattern  that  minimizes  the  number  of 
atom-atom  pairs.  Irrespective  of  the  nature  of 
interaction,  the  sites  are  occupied  uniformly  at 
growth  temperatures  well  above  the  corresponding 
critical  temperature. 

Growth  by  low-temperature  epitaxial  methods 
such  as  atomic  layer  epitaxy  (ALE)  and  molecular 


beam  epitaxy  (MBE)  is  believed  to  take  place 
under  nonequilibrium  growth  conditions  [1-4). 
However,  when  the  surface  relaxation  rate  is  much 
larger  than  the  arrival  rate,  a  thermodynamic 
model  can  be  expected  to  qualitatively  describe 
the  morphology  of  the  growth  surface.  This  is 
because  surface  atoms  will  have  enough  time  to 
minimize  their  free  energy  before  the  arrival  of  the 
next  atom.  Statistical  models  v/ith  various  degrees 
of  approximations  are  frequently  used  to  study  the 
nature  of  the  surface  (5-14).  To  our  knowledge, 
none  of  these  studies  has  been  extended  to  con¬ 
sider  the  surface  arrangement  of  vacancies  in  a 
zinc-blende  lattice. 

In  this  paper,  a  thermodynamic  model  in  a 
quasichemical  approximation  (QCA)  with  pair  in¬ 
teractions  is  used  to  study  the  equilibrium  multi¬ 
layer  growth  surface.  Results  of  multilayer  growth 
of  a  model  cubic  lattice  are  given.  The  strength  of 
pair  interaction  on  various  ideal  semiconductor 
surfaces  is  obtained  from  various  surface  sublima- 
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tion  energies  calculated  using  the  tight-binding 
Green’s  function  method  [15,16].  The  pair  interac¬ 
tion  energies  are  then  used  in  this  thermodynamic 
model  to  obtain  the  minimized  free  energy.  The 
qualitative  nature  of  realistic  semiconductor 
surfaces  is  discussed. 


2.  Tbermodymic  model 

We  assume  that  total  enthalpy  of  the  surface 
can  be  written  as  a  sum  of  various  pair  energies. 
In  the  QCA,  we  assume  that  these  pairs  are  dis¬ 
tributed  randomly  [17].  Although,  in  most  cases, 
this  approximation  is  better  than  the  Bragg- Wil¬ 
liams  approximation,  it  still  overestimates  the  en¬ 
tropy.  The  roughening  transition  temperature  ob¬ 
tained  for  a  square  lattice  with  QCA  is  about  25% 
larger  than  the  exact  value  due  to  Onsagar  [5].  In 
addition,  because  the  correlation  between  the  pairs 
and  beyond  are  assumed  to  be  zero,  the  surface  at 
the  critical  temperature,  where  large  scale  fluctua¬ 
tions  exist,  is  not  expected  to  be  described  cor¬ 
rectly.  However,  epitaxial  growth  by  ALE  and 
MBE  are  normally  carried  out  at  temperatures 
well  below  the  transition  temperature,  and  the 
approximation  considered  here  will  remain  valid. 
Higher  order  approximations  with  corresponding 
cluster  energies  will  be  the  subject  of  future  work. 
The  results  obtained  using  QCA  with  pair  interac¬ 
tion  can  be  obtained  equivalently  with  Bethe’s 
approximation  [8].  However,  generalization  to 
multilayer  growth  problems  and  higher-order  ap¬ 
proximations  are  straightforward  with  QCA  en¬ 
tropy  expression  used  here. 

To  calculate  the  free  energy  within  the  QCA, 
first  we  identify  all  distinct  classes  of  pairs  that 
contribute  to  free  energy.  Then  write  the  prob¬ 
ability  for  the  occurrence  of  each  type  of  pair. 
Using  appropriate  factors  for  indistinguishability 
of  pairs,  the  entropy  for  different  surface  layers  is 
written  with  constraints  relating  various  pairs.  The 
free  energy  obtained  from  the  entropy  and  the 
interaction  energy  for  each  class  of  pairs,  is  mini¬ 
mized  to  get  most  probable  distribution.  In  our 
multilayer  model,  an  atomic  site  in  any  upper 
layer  can  be  occupied  even  if  the  site  immediately 
below  it  is  not  occupied.  Although  these  config¬ 


urations  are  energetically  less  favorable,  they  con¬ 
tribute  to  entropy  and  thus  they  should  be  allowed 
in  the  calculation  of  free  energy.  In  a  multilayer 
model,  we  consider  four  types  of  pairs  (viz.,  AA. 
AV.  VA.  W)  within  a  layer,  and  the  same  four 
types  of  pairs  with  the  layer  below,  where  A  and 
V,  respectively,  represent  atom  and  vacancy.  The 
number  of  pairs  in  each  of  these  classes  in  the  i  th 
layer  is  denoted  as  and  '  where  p  takes 
values  0,  1,  2.  or  3.  and  are  corresponding 
pair  energies  in  units  of  /cT^,  where  7^  is  the 
growth  temperature.  We  further  denote  the  ran¬ 
dom  probability  of  finding  respective  pairs  in  the 
ith  layer  as  >'^(0)  and  Zp(O).  tjq  and  tj,  are  the 
number  of  intra-  and  inter-plane  near  neighbors, 
respectively.  Then  the  change  in  free  energy,  ii/,. 
is 


(II 


where 


(y. 


N' 

- ^ - Af'  O' 

N.'.  (N-N,)'. 


n 
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Qy 


Q7'‘' 


yp  = 


Q'p-'' 


p  M  ' 
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Vo(0)  =  x^  y;{0)=y^(0)  =  x,(l-x,), 

yU0)  =  (l-x,f, 


zi(0)  =x,x,_,,  r;(0)  =x,_,(l -X,), 

zU0)  =  (l-x,)(l-x,_,), 


with  constraints 


23'o+>'i'  +^2  =  2^,-  2yj-t-y;  -l-y4  =  2(l  -xj. 

y{  =yi- 

Zq  -t-  z]  =  X,  _  , ,  Zq  +  Z'l  =  X, ,  Z3  Z|  =  1  —  X, , 
z^-f  zi  =  1  -x,_,. 
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4  I 

Note  that  for  every  layer  i,  we  introduce  nine 
variables  (viz.,  A/^,  Qy‘  ‘  and  jc,),  and  eight 
constraints.  Thus,  the  free  energy  has  to  be  mini¬ 
mized  in  one  additional  variable  for  every  new 
layer  added.  As  a  test  case,  we  can  use  this  model 
to  study  the  (1(X))  surface  of  a  mono-atomic  sim¬ 
ple  cubic  solid.  We  further  assume  that  atom-atom 
pair  energy  is  tg  and  all  other  energies  are  zero. 
Then  we  find  that  smoolh-to-rough  transition  oc¬ 
curs  at  the  value  of  0.74  for  kT^/c^.  In  addition, 
the  transition  temperature  changes  little  upon  in¬ 
clusion  of  interlayer  interactions.  This  is  in  agree¬ 
ment  with  that  obtained  by  Burton  and  coworkers 
[8],  who  calculated  the  transition  temperature  in 
Bethe’s  approximation.  Although  the  surface  de¬ 
composition  and  transition  temperature  change 
very  little  with  the  addition  of  a  layer,  the  finite 
values  obtained  in  our  calculation  of  the  number 
of  various  pairs  in  different  layers  will  provide 
information  on  surface  morphology  when  realistic 
crystal  structures  are  examined. 


3.  Pair  interaction  energies 

The  model  described  in  the  previous  section  has 
been  generalized  to  the  case  of  diamond  and  zinc- 
blende  lattices.  In  order  to  use  this  model  to  study 
realistic  semiconductor  surfaces,  we  need  the  pair 
interaction  energies.  These  energies  for  various 


surface  orientations  are  obtained  using  the  tight- 
binding  Green’s  function  (GF)  method.  Because 
the  details  of  calculational  procedure  have  been 
pubhshed  elsewhere  [15.16,18].  here  the  discussion 
will  be  limited  to  offer  only  continuity. 

We  define  surface  sublimation  energy  (SSE)  as 
the  difference  between  the  total  energy  of  a  surface 
plus  a  free  atom  and  that  of  the  surface  with  the 
atom  bonded  to  it.  In  this  convention,  a  positive 
SSE  means  that  atoms  will  require  energy  to  go 
from  surface  to  vapor.  The  calculation  of  SSE 
proceeds  in  two  steps.  First,  the  GF  for  an  ideal 
atomically  smooth  surface  is  calculated  in  terms  of 
bulk  GF.  using  Dyson’s  expansion.  Then,  the 
modified  GF  with  an  atom  added  to  or  removed 
from  the  surface  is  calculated  in  terms  of  surface 
GF.  The  change  density  of  states  introduced  by 
perturbations  is  used  to  obtain  the  electronic  en¬ 
ergy  contribution  to  SSE. 

The  perturbation  to  the  surface  that  involves 
the  removal  of  an  atom,  leaving  a  surface  vacancy 
behind,  is  referred  to  as  surface  sublimation  from 
concentrated  surface  and  the  corresponding  SSE  is 
denoted  as  E^.  This  removal  entails  breakjng  bonds 
with  the  layer  below  and  in-plane.  and  sometimes, 
second-neighbor  surface  bonds.  The  perturbation 
to  the  surface  that  involves  the  removal  of  an 
atom  thus  leaving  an  ideal  flat  surface  behind,  is 
referred  to  as  sublimation  from  dilute  surface  and 
the  corresponding  SSE  is  denoted  as  E^.  Here,  the 
removal  necessitates  breaking  bonds  only  with  the 
layer  below. 

In  this  model,  it  is  clear  that  the  effective 
atom- atom  pair  interaction  energy,  between  atoms 
on  the  surface  is  simply  (£^  -  E^)/i)q,  where  t/q  is 


Table  1 

In-plane  pair  interaction  energy.  <o  and  transition  temperature;  all  energies  are  in  eV;  number  of  bonds  broken  with  the  layer  below 
are  given  in  parentheses;  the  roughening  transition  temperature  (7^)  and  order-disorder  transition  temperature  )  are  in  kelvin 


Orien¬ 

tation 

Removed 

atoms 

Si 

GaAs 

CdTe 

HgTe 

*0 

7;(K) 

‘o 

r,(K) 

‘o 

7;(K) 

‘o 

rc(K) 

(111) 

a(l) 

0.35 

3700 

0.25 

2600 

0.05 

500 

.0.33 

3500 

c(3) 

0.37 

3900 

0.02 

200 

-0.35 

1400  * 

0.23 

2400 

(TTi) 

a(3) 

0.37 

3900 

0.07 

700 

-0.52 

2100  • 

0 

0 

c(l) 

0.35 

3700 

-C.IO 

410  • 

-0.15 

600  • 

0.08 

900 

(1(X)) 

a(2) 

0.48 

3200 

0.33 

2200 

-0.60 

4000  * 

0.18 

1200 

c(2) 

0.48 

3200 

0.25 

1700 

-0.50 

3300  • 

0.30 

2200 
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the  number  of  near-neighbor  surface  bonds.  A 
positive  (negative)  implies  an  attractive  (repul¬ 
sive)  interaction  between  surface  atoms.  Calcu¬ 
lated  values  of  Cq  are  given  in  table  1  for  (111), 
(111)  and  (100)  surfaces  of  Si,  GaAs,  CdTe  and 
HgTe.  Notice  that  the  (111)  surface  can  terminate 
either  with  triply  bonded  cations  or  with  singly 
bonded  anions.  The  op^posite  bonding  arrange¬ 
ment  is  found  in  the  [Ill]  direction. 

is  expected  to  be  larger  than  £j,  because  all 
dangling  bonds  in  the  concentrated  case  interact 
to  form  a  partially  filled  band  with  resulting  lower 
energy.  We  see  that  this  interpretation  explains 
the  trends  observed  in  Si,  HgTe  and,  in  most 
cases,  GaAs.  However,  in  all  other  cases  we  find 
the  in-plane  surface  interaction  to  be  repulsive. 
The  mechanism  that  drives  some  semiconductor 
orientations  to  have  £^.  smaller  than  has  been 
interpreted  to  be  the  charge  transfer  between  the 
surface  Fermi  level  and  the  newly  exposed  dan¬ 
gling  bond  states  (15).  Because  such  a  transfer  is 
large  in  wide-gap  ionic  compounds,  the  surface 
interactions  CdTe  and  ZnTe  are  predicted  to  be 
repulsive. 

4.  Results  and  discussion 

In  ALE,  grown  from  a  free  atom  vapor  phase, 
only  one  type  of  atom  species  (anion  or  cation)  is 
present  at  any  given  time.  The  atom-atom  pair 
energies  calculated  in  the  previous  section  can  be 
used  in  a  single-layer  growth  model  to  obtain 
minimized  free  energy  as  function  of  layer  cover¬ 
age,  X.  In  this  model,  all  other  pair  energies  are 
assumed  to  be  zero.  When  the  interaction  between 
the  atoms  is  attractive  («o>0),  the  surface  will 
undergo  a  smooth- to-rough  transition  as  the  grow¬ 
th  temperature  is  varied  near  the  critical  tempera¬ 
ture,  7^.  At  temperatures  below  7^,  the  growth  will 
take  place  with  the  formation  of  islands  and  atomic 
concentration  in  those  islands  will  be  given  by 
location  multiple  minima  in  free  energy  versus  x 
curve.  At  temperatures  well  below  T^,  the  minima 
are  expected  near  x  =  0  and  jc  =  1.  In  these  cases, 
islands  formed  on  the  surface  are  nearly  fully 
occupied  and  the  reminder  of  the  area  is  nearly 
empty.  As  the  surface  coverage  is  increased,  the 


islands  grow  in  size  with  a  fixed  vacancy  con¬ 
centration  until  the  layer  is  fully  grown. 

Because  the  (100)  surface  is  a  square  lattice, 
exact  value  of  0.57  t^/k^  given  by  Onsagar  [5]  is 
used  to  obtain  7^..  is  the  Boltzmann  constant 
For  the  hexagonal  lattice,  observed  in  (111)  orien¬ 
tation,  the  exact  value  of  is  0.91  t^/k^  [19). 
The  calculated  are  given  in  table  1  for  Si,  HgTe 
and  most  GaAs  surfaces.  Note  that  the  Tj.  is  much 
larger  than  the  typical  growth  temperature  for 
these  elements  and  compound  (except  for  (111)  Te 
terminated  HgTe  surface)  ALE  growth  of  these 
materials  is  dominated  by  formation  of  nearly 
perfect  islands. 

When  the  interaction  between  surface  atoms  is 
repulsive,  as  in  a  Ga-terminaied  (Ill)  GaAs 
surface  and  all  CdTe  surfaces,  the  free  energy  of 
eq.  (1)  always  remains  negative  with  one  mini¬ 
mum.  However,  these  surfaces  undergo  a  different 
kind  of  phase  transition  known  as  order-disorder 
transition.  At  temperatures  above  the  critical  tem¬ 
perature,  7^*.  the  surface  is  disordered  and  sites 
will  be  occupied  randomly  by  atoms  and  vacan¬ 
cies.  At  temperatures  below  7'^*.  the  atoms  and 
vacancies  on  the  surface  will  arrange  themselves  in 
domains  of  ordered  patterns  to  minimize  the  num¬ 
ber  of  atom-atom  pairs.  The  nature  of  the  super¬ 
lattice  arrangements  and  area  of  these  domains 
depend  on  the  growth  temperature  and  surface 
coverage.  For  example,  at  temperatures  well  below 
7^*  for  a  square  lattice  of  50%  coverage,  the 
superlattice  pattern  will  look  like  a  checker  board, 
where  every  near-neighbor  site  of  atom  (vacancy) 
is  occupied  by  vacancies  (atoms).  This  type  of 
arrangement  can  be  envisioned  for  a  mono-layer 
growth  in  (100)  orientation.  7"^*  calculated  using 
Onsagar’s  result  for  (100)  surfaces  and  that  using 
an  exact  value  of  0.35  (q/^b  (20]  for  (111)  orienta¬ 
tions  are  given  in  table  1.  We  see  that  the  calcu¬ 
lated  temperatures  are  well  above  the  typical  ALE 
or  MBE  growth  temperatures  for  these  com¬ 
pounds  (except  for  (111)  Ga  terminated  GaAs 
surface).  Hence  the  partially  filled  growth  surface 
is  expected  to  exhibit  superlattice  arrangements 
with  atoms  and  vacancies. 

In  MBE  growth  of  compounds,  both  types  of 
species  (anion  and  cation)  a>  c  present,  and  simul¬ 
taneous  growth  of  an  anion-ctiion  double  layer 
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can  be  expected.  We  extend  our  model  to  study 
the  double-layer  growth  of  semiconductors  with 
iinc-blende  lattice.  The  intralayer  pair  interaction 
energies  deduced  for  the  single  layer  growth  model 
.  re  still  valid.  The  inter-layer  anion-cation  pair 
interaction  energies  are  simply  where  tj,  is 

the  number  of  interlayer  near  neighbor  sites.  We 
define  an  anion  stabilized  growth  to  be  the  case 
where  relative  anion  and  cation  fluxes  are  such 
that  any  exposed  surface  atoms  are  anions.  When 
the  interaction  between  the  atoms  in  the  upper 
layer  and  that  in  the  lower  layer  is  strong  (com¬ 
pared  to  A.  7^),  the  surfaces  always  grow  in  the 
smooth  limit.  (100)  surfaces  of  HgTe  and  GaAs 
and  all  surfaces  of  Si  are  predicted  to  fall  into  this 
category.  However,  when  the  interaction  between 
adjacent  layers  are  not  always  strongly  attractive, 
as  in  the  case  of  (llT)  HgTe  and  (111)  GaAs 
surfaces,  the  nature  of  growth  depends  on  whether 
it  is  anion  or  cation  stabilized. 

The  free  energy  calculated  in  QCA.  at  a  growth 
temperature  of  185°C,  for  Te  and  Hg  stabilized 
(TIT)  surfaces  is  plotted  respectively  in  figs,  la 
and  lb  as  a  function  of  Te  and  Hg  surface  con¬ 
centration.  Each  corner  of  this  plot  represents  the 
surface  fully  covered  by  the  element  noted  there. 
In  ALE.  growth  takes  place  along  the  sides. 
Whenever  the  minimum  occurs  between  the  two 
corners,  that  surface  is  predicted  to  grow  in  the 
rough  limit  where  atoms  and  vacancies  randomly 
occupy  the  surface  sites.  Whenever  the  minima 
occur  near  the  comers,  the  surface  is  predicted  to 
grow  in  the  smooth  limit  with  formation  of  nearly 
perfect  islands.  In  MBE  where  both  species  are 
present  in  double-layer  stochastic  growth  of  com¬ 
pounds,  the  above  arguments  are  applied  to  the 
free  energy  curve  along  the  diagonal  line  connect¬ 
ing  the  origin  and  upper-right  comer.  Accord¬ 
ingly,  from  fig.  1,  we  see  that,  while  ALE  growth 
of  Hg  and  Te  layer  takes  place  in  the  smooth  and 
rough  limits  respectively,  MBE  growth  of  Te 
stabilized  surface  takes  place  in  the  smooth  limit. 
This  is  because  the  pair  energy  for  Te(  upper 
layer)- Hg(lower  layer)  is  strong,  0.9  eV  (15,16). 
However,  the  Hg  stabilized  surface  grows  in  the 
rough  limit  because  the  Hg  in  the  upper  layer  is 
weakly  attracted  to  Te  below  (nearly  0  eV  (15,16)) 
and  atoms  do  not  congregate  into  islands  to  maxi- 


Al 


Fig.  I.  Free  energy  for  (a)  Te  stabilized  and  (b)  Hg  stabilized 
(III)  surface  of  HgTe  at  ISS'C  as  a  function  of  Hg  and  Te 
surface  concentrations. 


mize  the  cation-anion  pairs.  Similar  behavior  is 
expected  in  the  growth  of  (111)  GaAs  surface. 

These  calculations  were  carried  out  only  for 
those  cases  where  the  interaclio  is  are  attractive. 
For  the  repulsive  case,  it  is  known  that  QCA  with 
pair  interactions  predicts  incorrect  phase  diagrams 
(21).  It  has  been  demonstrated  that  the  smallest- 
sized  cluster  to  get  a  correct  phase  diagram  con¬ 
tains  four  atoms  in  a  fee  lattice  (22).  Similar 
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calculations  with  cluster  variation  method  will 
have  to  be  carried  out  for  studies  of  most  CdTe 
surfaces. 


5.  Conclusions 

We  have  calculated  intralayer  and  interlayer 
atom-atom  pair  interaction  energies  using  a 
tight-binding  Green’s  function  method.  These  en¬ 
ergies  are  then  used  in  a  thermodynamic  model  to 
study  the  nature  of  growth  surfaces.  As  the  growth 
temperature  is  varied,  attractive  interaction  be¬ 
tween  surface  atoms  leads  to  smooth-to-rough 
transition,  and  Si,  HgTe  and  most  GaAs  surfaces 
are  calculated  to  fall  in  this  category;  the  repulsive 
interactions  lead  to  superlattice  ordered-to-dis- 
ordered  transition,  and  (IT!)  GaAs  and  most  CdTe 
surfaces  are  expected  to  be  in  this  category. 
Smooth  or  rough  surface  growth  is  found  to  be 
decided  by  inplane  interactions  in  ALE  growth 
and  by  interplane  interactions  in  double-layer 
MBE  growth.  In  our  calculations  of  pair  energies, 
we  considered  only  ideal  surfaces.  The  critical 
temperatures  and  other  conclusions  in  this  paper, 
can,  at  best,  be  expected  to  represent  the  trends. 
Nonidealities  such  as  reconstruction,  dimerization, 
relaxation  of  adatoms  and  surface  can  lower 
surface  energy  substantially  [23-26].  For  quantita¬ 
tively  more  accurate  values,  the  effects  of  these 
mechanisms  on  the  values  ol  (E^-  E^)  and  statis¬ 
tical  approximation  that  are  better  than  QCA  with 
pairs  will  have  to  be  included. 
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A  Green’s  function  method  is  used  to  calculate  the  removal  energies  of  constituent  atoms 
from  various  unreconstructed  semiconductor  surfaces.  An  efficient  difference  equation 
approach  within  the  second-neighbor  tight-binding  model  is  used.  For  a  compound  AB, 
binding  energies  for  the  A  and  B  atoms  on  the  (111),  (Ill),  (100),  and  (110)  surfaces  are 
calculated.  Energy  to  remove  an  atom  from  the  nearly  full  surface,  (where  the 
removed  atom  leaves  behind  a  surface  vacancy),  and  from  the  nearly  empty  surface, 

(where  the  removed  atom  was  isolated  on  the  surface),  is  obtained.  Results  are  presented  for 
Si,  GaAs,  CdTe,  and  HgTe.  The  surface  sublimation  energies  are  shown  to  depend  on 
surface  coverage  and  do  not  exhibit  a  simple  linear  relationship  to  the  number  of  bonds 
broken,  as  is  often  assumed  in  modeling  growth  by  molecular-beam  epitaxy  (MBE). 
Although  the  anion  and  cation  extraction  energies  depend  on  surface  coverage  and 
orientation,  when  averaged  over  a  double  layer,  they  always  sum  to  the  bulk  cohesive  energy 
Moreover,  E^  —  Ej  can  be  positive,  implying  effective  attractive  in-plane  surface  interactions, 
or  negative,  implying  effective  repulsive  interactions.  E^  —  E^  tends  to  be  positive  for 
covalent  and  narrow-gap  semiconductors,  and  negative  for  wide-gap  and  more  ionic 
semiconductors.  Surface  sublimation  energies  are  important  input  parameters  for  the 
modeling  of  MBE  growth;  their  importance  is  demonstrated  using  a  simple  thermodynamic 
growth  model  and  results  are  shown  to  explain  anomalies  found  in  MBE  growth  of 
HgCdTe. 


I.  INTRODUCTION 

In  recent  years  there  has  been  considerable  advance¬ 
ment  in  the  field  of  epitaxial  growth  of  semiconductors, 
with  molecular-beam  epitaxy  (MBE)  being  one  of  the 
most  important  techniques.  Modeling  of  the  MBE  growth 
process  involves  a  description  of  the  incoming  and  outgo 
ing  fluxes  of  atoms  from  the  surfaces,  as  well  as  in-surfac  ■ 
migration  and  reactions  among  the  constituents.  An  ii.i- 
portant  contribution  to  the  out-flux  arises  from  the  subli¬ 
mation  of  the  constituents  from  the  growth  surface.  At¬ 
tempts  to  determine  the  activation  energy  for  surface 
sublimation  processes  have  been  made  in  a  variety  of  ex¬ 
periments  on  GaAs'"'  and  CdTe.^®  Because  the  values 
obtained  depend  on  the  experimental  environmental 
conditions,’  the  reported  values  differ  substantially.  More¬ 
over,  the  experiments,  in  which  the  evaporation  of  many 
layers  is  permitted,  measure  quantities  related  to  bulk  co¬ 
hesive  energies  rather  than  to  the  surface-orientation- 
dependent  activation  energies  which  are  more  relevant  to 
growth  modeling.  Yet  in  the  modeling  of  MBE  growth, 
accurate  values  of  the  activation  energies  for  the  removal 
of  constituent  atoms  from  various  sites  on  a  given  surface 
to  the  vapor  are  essential.'®  "  In  this  paper,  we  calculate 
the  surface  sublimation  energy  (SSE)  for  various  classes  of 
sites. 

Several  theoretical  methods  such  as  cluster  methods, 
slab  methods,  and  Green’s  function  methods  have  been 
used  in  the  literature'^"”  to  study  semiconductor  surfaces. 
Both  the  cluster'"  '^  and  slab'’  '*  methods  approximate  the 


semi-infinite  bulk  terminated  at  a  particular  surface  by  a 
finite  number  of  atoms  or  two-dimensional  (2D)  atomic 
layers,  respectively.  While  first-principles  electronic  struc¬ 
tures  can  be  incorporated  into  these  methods,  doing  so  is 
extremely  time  consuming,  because  the  size  of  the  matrix 
involved  in  these  calculations  is  often  large  as  a  result  of 
the  range  of  the  surface  wave  functions.  Also,  the  calcula¬ 
tions  require  taking  differences  between  two  large  total  en¬ 
ergies.  The  present  method  differs  in  detail  from  other 
Green’s  function  methods'*^'’  advanced  for  the  study  of 
surfaces  in  that  we  take  full  advantage  of  the  symmetry  of 
the  surface  problem,  and  reduce  the  problem  to  an  efficient 
solution  of  a  difference  equation.^® 

In  this  paper,  we  use  a  tight-binding  (TB)  Green’s 
function  (GF)  method  to  calculate  the  SSE.  The  calcula¬ 
tion  of  the  SSE  proceeds  in  two  steps.  First,  the  GF  for  the 
ideal  flat  surface  is  calculated  in  terms  of  the  bulk  GF 
using  Dyson’s  expansion.  Second,  the  GF  for  the  surface 
with  an  atom  either  removed  from  or  added  to  the  ideal 
surface  is  calculated  in  terms  of  the  surface  GF.  Unlike  the 
c'uster  or  slab  methods,  the  size  of  the  matrix  equation  to 
be  solved  is  determined  by  the  size  of  the  perturbation 
Hamiltonian,  and  thus  exploiting  the  in-plane  periodicity 
of  the  unperturbed  surface,  the  resulting  matrix  is  compar¬ 
atively  small.  Additionally,  the  electronic  contribution  to 
the  total  energy  can  be  obtained  from  the  change  in  the 
density  of  states  (DOS)  which  is  calculated  directly  from 
the  unperturbed  GF  and  the  perturbation  Hamiltonian, 
thus  eliminating  problems  associated  with  calculating 
small  energies  by  taking  the  differences  between  two  large 
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energies.  The  DOS  is  calculated  as  a  continuous  function 
of  energy,  thus  treating  bulk  and  surfacelike  states  on  an 
equal  footing.  The  GF  is  calculated  using  a  novel  and  com¬ 
putationally  efficient  difference  equation  approach  devel¬ 
oped  recently  by  Chen.’° 

We  calculate  SSE  for  two  classes  of  cations  and  anions 
for  each  surface.  When  the  perturbation  to  the  surface  GF 
involves  the  removal  of  an  atom  from  an  ideal  surface, 
leaving  a  surface  vacancy  behind,  we  refer  to  the  process  as 
the  surface  sublimation  from  a  concentrated  surface.  The 
energy  required  to  remove  the  atom  in  this  limit  is  denoted 
E,:  When  the  perturbation  to  the  surface  GF  involves  the 
addition  of  an  atom  to  the  ideal  surface,  thus  beginning  a 
new  layer  of  atoms,  we  refer  to  the  inverse  of  this  addition 
process  as  the  surface  sublimation  from  a  dilute  surface, 
and  to  the  energy  as  E^.  Orientation-dependent  E^  and 
Ej  are  energies  required  in  modeling  MBE  growth. 

The  remainder  of  the  paper  is  organized  as  follows. 
The  method  of  obtaining  the  Green’s  functions  for  the 
bulk,  the  surface,  and  the  surface  with  an  additional  atom 
added  to  or  removed  from  the  surface  is  described  in  Sec. 
II.  In  Sec.  Ill,  calculated  bulk  and  surface  DOS  and  sur¬ 
face  sublimation  energies  of  the  constituent  elements  from 
(100),  (111),  (TIT),  and  (110)  surfaces  of  Si,  GaAs, 
HgTe,  and  CdTe  are  discussed.  To  illustrate  the  impact  of 
these  energies  on  growth,  they  are  incorporated  into  a  sim¬ 
ple  growth  model  in  Sec.  IV.  Concluding  remarks  are  given 
in  Sec.  V. 


II.  GREEN’S  FUNCTION  METHOD 

The  calculation  of  the  SSE  is  divided  into  two  pans. 
First,  the  surface  GF  is  calculated  in  terms  of  the  bulk  GF 
(which  is  formulated  in  terms  of  2D  Bloch  states).  The 
surface  GF  is  calculated  in  terms  of  the  bulk  GF  using  a 
difference  equation  technique.'*^  Second,  the  GF  for  the 
surface  with  an  atom  added  to  or  removed  from  the  surface 
is  calculated  in  terms  of  the  surface  GF.  The  surface  GF 
calculation  has  also  been  applied  to  the  calculation  of  the 
cleavage  energy,"'  and  the  following  calculational  proce¬ 
dure  for  the  surface  GF  also  appears  in  Ref  21;  it  is  in¬ 
cluded  here  for  completeness,  and  to  provide  the  ground¬ 
work  for  the  calculation  of  SSE, 

The  bulk  GF  is  calculated  as  follows.  Let  //g  be  the 
Hamiltonian  of  the  infinite  periodic  solid,  with  the  corre¬ 
sponding  eigenvalue  equation 


(Hb-E)\i!,}=0.  (1) 

For  the  present  problem,  it  is  convenient  to  decompose  the 
three-dimensional  lattice  into  identical  2D  planar  slabs, 
labeled  by  n,  with  their  orientation  determined  by  the  par¬ 
ticular  surface  to  be  studied.  Each  slab  is  in  turn  decom¬ 
posed  into  unit  cells,  described  by  real-space  lattice  vectors. 
In  parallel  to  the  slab  interface.  We  expand  the  wave  func¬ 
tions,  I  if’),  in  terms  of  2D  Bloch  states,  |n;ak||); 

|i/')=  S  X  c„(a)  |n;ak;|>.  (2) 

n  a 


Here  n  sums  over  the  planar  slabs  and  a  sums  over  the 
tight-binding  basis  functions  within  each  unit  cell.  The 
C„(a)  are  expansion  coefficients  to  be  determined  The 
Bloch  states  are  defined  by 

|n;ak„)=—  I  i«;al,,),  (3) 

\-Vi  I 

where  the  sum  is  over  the  two-dimensional  lattice  in  the 
nth  slab,  k  is  a  two-dimensional  Brillouin  zone  (BZ)  wave 
vector,  and  A',  is  the  number  of  unit  cells  in  each  slab. 

If  the  Hamiltonian  contains  only  nearest-neighbor  slab 
interactions,  then  Eqs.  ( 1 )  and  (2)  reduce  to  the  difference 
equation 

f^C„_,+AC„  +  EC„^,=0.  (4) 

where  the  intraplanar  and  interplanar  Hamiltonian  matri- 


ces  A.  F,  and  F*  are,  respectively. 

!  {HB-E)\n-a'k:X 

(5a) 

(5b) 

and 

(5c) 

The  general  solution  to  Eq.  (4)  is  given  by"° 


C„=  I  a/;/3;.  (6) 

j 

where  r^  and  Ej  are  the  yth  eigenvalue  and  corresponding 
eigenvector  of  the  following  characteristic  equation: 

{F-  +rA-¥r'F)!}  =  Q.  (7) 

The  Qj  of  Eq.  (6)  are  expansion  coefficients  determined  by 
the  boundary  conditions.  Letting  y=rP.  Eq.  (7)  reduces  to 
a  (2m  X  2m)  matrix  eigenvalue  problem 


-F~'A 


r]_  fr] 


(8) 


where  m  is  the  number  of  basis  functions  per  unit  cell.  We 
use  a  basis  set  of  one  s  and  three  p  wave  functions  on  eech 
atom;  thus  m  is  four  times  the  number  of  atoms  in  a  unit 
cell.  For  example,  for  a  zinc-blende  structure  with  planar 
slabs  chosen  parallel  to  the  (111)  plane,  the  slab  is  chosen 
to  contain  one  cation  and  one  anion  layer;  thus  m=8.  The 
bulk  GF,  C®,  can  be  obtained  similarly  by  solving  the  equa¬ 
tion 


{Hb-E)G^=-I,  (9) 

where  //^  is  the  bulk  Hamiltonian  and  I  is  the  unit  matrix. 
The  corresponding  matrix  equations  in  the  planar  Bloch 


basis  are 

(10a) 

and 

7^gL,,„  +  /1G1,  +  F’G®.,,„=0, 

(10b) 

where  the  subscripts  on  G  are  layer  indices.  From  the  the 
general  solution  to  Eq.  (10),  the  GF  of  interest  can  be 
written  as 
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G‘„=\F{QXQ-'-PRp-')]-', 
'Gf,.  m>n. 


(18a) 


(lla) 

(llb) 

and 

Gl,„  =  QX'”-''Q-'G^„„  m<n,  (11c) 

where  X  and  R  are  diagonal  matrices  containing  eigenval¬ 
ues  of  magnitude  greater  than  and  less  than  unity,  respec¬ 
tively,  and  Q  and  P  are  the  corresponding  eigenvector  ma¬ 
trices  obtained  from  Eq.  (8).  Finally,  in  the  usual  manner, 
the  density  of  states  (DOS)  can  be  calculated  from 


\g1i{E)—  2  Gfr(E)yfxASQ.u^u.rGfA£).  n^O, 

r.r 

and 

Ago.o(^)  =  (^-^"o.o  I  ^o.rGf.r(^)>"r,o)  (18b) 

Note  that  subscripts  for  Green’s  functions  in  Eq.  (18)  are 
now  site  indices  for  a  particular  layer,  and  n  is  the  layer 
containing  site  /.  The  corresponding  change  in  DOS  is  then 
simply  given  by 


p^(£)=— ImTrf^  I  Gf,„(k,,.£  +  /T,)V  (12)  Ap/(£)  =— ^  Im  Trf  I 

^  k||  j  ^  \  n 


y 

I 


(19) 


where  77  is  vanishingly  small. 

From  the  bulk  layer  GF,  the  surface  GF  is  obtained 
using  Dyson's  expansion.  The  surface  is  created  by  the 
bond-cutting  method,  in  which  the  interactions  between 
two  semi-infinite  bulks  are  set  to  zero.  Hence,  the  Hamil¬ 
tonian,  Hs,  for  the  two  semi-infinite  bulks  separated  be¬ 
tween  planes  0  and  1,  is  given  by 

Hs=Hb-F-F^.  (13) 

Then  the  intraplanar  surface  GF,  Gf  „,  is  given  by 

=  G;?,o£*(  1  +  Gfo£^)  -  'Gf,„.  ( 14) 

Finally,  the  surface  DOS  is  given  by 

p5(£)=^ImTr^^  I  Gf,„(k|„£ -f /ij)  j  ,  (15) 

and  the  corresponding  change  in  DOS 

pc(£)=I  (Ps-Ps)-  (16) 

/> 

In  Eq.  ( 16),  the  summation  over  all  layers  n>/  is  found  to 
converge  rapidly  away  from  the  surface.  Note  that  in  the 
calculation  of  ps  both  bulklike  and  surface-localized  states, 
which  show  substantial  dispersion  as  a  result  of  the  in¬ 
plane  periodicity,  are  treated  on  an  equal  footing. 

SSE  values  are  calculated  for  both  the  dilute  and  con¬ 
centrated  surface  coverage,  as  defined  in  Sec.  I.  For  the 
dilute  surface,  a  constituent  atom  is  placed  on  the  surface 
with  bond  lengths  and  angles  corresponding  to  the  bulk 
equilibrium  values.  Only  first-neighbor  interactions  of  the 
adatom  with  the  surface  atoms  are  included,  a  good  ap¬ 
proximation  for  covalent  semiconductors.  The  full  Hamil¬ 
tonian  for  the  surface  with  an  isolated  adatom,  H],  can 
then  be  written  as 

H/=Hs+  X  (^0.0+  ^o,/+  ^/.o)>  (J"^) 

I 

where  j-  is  the  matrix  element  between  sites  /'  and  /'. 
The  sum  goes  over  all  nearest-neighbor  sites,  /,  of  the  atom 
added  at  site  0.  The  change  in  the  GF  resulting  from  the 
addition  of  an  atom  to  the  surface  is  given  by 


where  the  sum  is  over  all  layers  n.  and  is  found  to  converge 
rapidly  away  from  the  surface.  The  sum  over  all  sites 
within  a  layer,  n.  can  be  obtained  exactly  by  writing  the 
local  GF  in  terms  of  Bloch  Green's  functions  of  Eq.  (14) 
and  computing  the  two-dimensional  BZ  integration.  The 
change  in  DOS  is  given  by 


Ap/(£)= - lmTrjAgoo+  X  TT  X 


'5  k, 


C^./(ki,£) 


X 


(20) 


where  we  have  reintroduced  the  slab  GF.  Because  we  are 
interested  in  the  removal  of  an  isolated  atom  from  the 
surface,  the  negative  of  Eq.  (20)  enters  into  the  calculation 
of  the  SSE  for  a  dilute  surface. 

In  the  other  extreme  case,  in  which  an  atom  is  re¬ 
moved  from  a  nearly  full  surface  (the  concentrated  limit), 
the  surface  sublimation  energy  is  obtained  by  letting  the 
site  diagonal  matrix  elements  of  the  atom  at  site  /=0  go  to 
infinity.  For  this  perturbation,  the  change  in  the  DOS  is 
given  by 


Ap/(£)=— lmTr(  X 

TT 


1  d  , 

=  - Irn  Tr— [In(G^o)].  (21) 

TT  or, 


ill.  SURFACE  SUBLIMATION  ENERGIES 

We  define  surface  sublimation  energy  as  the  difference 
between  the  total  energy  of  a  surface  plus  an  isolated  free 
atom  and  the  surface  with  that  atom  bonded  to  it.  In  the 
TB  model,  the  total  energy  is  written  as  the  sum  of  the 
electronic  energy  of  the  occupied  state  and  the  repulsive 
energy  Fq  between  the  nearest-neighbor  pairs.^^  The 
change  in  the  electronic  energy  is  calculated  from  the 
change  in  DOS,  Eqs.  (20)  and  (21)  above,  via 

A£.,,=  Ap,iE)dE.  (22) 
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The  integral  over  energy  extends  to  the  highest  filled  state, 
E„,  which  is  detenrined  by  the  charge  neutrality  condition 
that 

2  =(!"'”  Ps^E-  =  (23) 

where  is  the  highest  filled  bulk  state  (the  valence-band 
edge  in  our  zero-temperature  calculations)  and  +  or 
—  sign  has  to  be  used,  respectively,  for  cation-  or  anion- 
terminated  polar  surface.  The  quantity  A  is  detem.ined  so 
that  there  is  no  net  potential  due  to  alternating  parallel 
planes  of  charged  anion  and  cation  layers  in  the  case  of 
polar  surfaces.  A  is  (Q^— 3)/4  per  dangling  bond  per  unit 
cell,  where  is  the  net  charge  on  the  cation  in  the  bulk. 
Notice  that  A  is  zero  for  a  (1 10)  surface  because  the  num¬ 
bers  of  anion  and  cation  dangling  bonds  on  this  surface  are 
equal.  will  in  general  depend  on  the  particular  surface 
and  may  differ  for  the  complementary  anion  and  cation 
termination  of  a  polar  surface.  For  an  ideal  cation- 
terminated  surface.  E„  will  be  at  the  energy  level  corre¬ 
sponding  to  the  cation  dangling  bond,  which  is  normally 
near  the  conduction-band  edge.  Similarly,  for  an  ideal 
anion-terminated  surface,  will  be  at  the  energy  level 
corresponding  to  the  anion  dangling  bond,  which  is  nor¬ 
mally  near  the  valence-band  edge.  The  bond  length  and  the 
interatomic  and  intraatomic  matri.x  elements  for  the  atoms 
on  the  surface  are  assigned  their  bulk  values,  and  only 
nonreconstructed  surfaces  are  considered.  This  calculation 
can  be  easily  e.\tended  to  account  for  bond-length  relax¬ 
ation  and  surface  reconstruction  by  using  a  larger  unit  cell. 
While  these  approximations  limit  our  accuracy,  it  is  the 
first  step  toward  evaluating  these  SSE  realistically. 

The  change  in  DOS  due  to  creation  of  a  surface  is 
calculated  for  the  anion-  and  cation-terminated  (111), 
(ITT),  ( 100),  and  (110)  surfaces  for  elemental  silicon  and 
for  GaAs,  CdTe,  and  HgTe  compounds.  In  all  the  calcu¬ 
lations,  second-neighbor  tight-binding  Hamiltonians  with 
four  orbitals  per  atom  are  used  to  obtain  the  bulk  elec¬ 
tronic  structures. Slabs  containing  two  atomic  layers 
are  defined  for  each  surface  and  the  resulting  size  of  the 
GF  matrix  is  8  X  8  for  the  ( 1 1 1 )  and  ( 100)  surface  orien¬ 
tations.  In  order  to  hold  the  interplanar  coupling  to  a  first- 
neighbor  layer,  the  size  of  the  basis  for  the  ( 1 10)  direction 
is  increased  to  contain  four  atoms  per  unit  cell,  resulting  in 
a  16x16  matrix.  An  analytical  continuation  method’*’  is 
used  to  simplify  the  2D  BZ  integration. 

As  an  example,  we  present  detailed  results  for  the 
(100)  and  (110)  surfaces  (cleavage  surfaces)  of  GaAs, 
and  genera)  results  for  the  SSE  of  Si,  GaAs,  CdTe,  and 
HgTe.  The  bulk  DOS  for  GaAs,  obtained  from  Eq.  ( 12),  is 
shown  in  Fig.  1.  The  DOS  calculated  using  the  difference- 
equation  approach  presented  above  is  in  excellent  agree¬ 
ment  with  that  obtained  directly  from  the  band  structure. 
The  change  in  DOS  between  the  surface  and  bulk,  calcu¬ 
lated  from  Eq.  (16),  is  shown  in  Fig.  2  for  (a)  an  anion- 
terminated  and  (b)  a  cation-terminated  (100)  surface  of 
GaAs.  The  layer  DOS  was  found  to  converge  to  the  bulk 
value  by  the  fifth  slab  and  thus  the  layer  sum  in  Eq.  ( 16) 
has  been  carried  out  over  only  the  first  four  slabs  from  the 


energy  (eV) 


FIG  )  Bulk  DOS  for  GiiAs  in  ihe  units  of  states/ electron  volt  cell 
Energy  is  measured  from  the  top  of  the  valence  band 

surface.  In  Fig.  3,  the  change  in  DOS  between  the  surface 
and  bulk  in  the  vicinity  of  the  band  gap  for  the  (110) 
GaAs  cleavage  plane  is  shown.  Note  that  two  states  are 
introduced,  an  anionlike  state  at  the  valence-band  edge  and 
a  cationlike  state  in  the  midgap  region.  The  cleavage  ener¬ 
gies  calculated  from  the  difference  between  the  bulk  and 
surface  DOS  for  the  ( 1 10)  surface  for  GaAs  and  the  ( 1 1 1 ) 
surface  for  silicon  are  found  to  be  in  good  agreement  with 
experiments.’'  Finally,  the  change  in  DOS  due  to  the  re¬ 
moval  of  an  isolated  anion  or  cation  from  the  GaAs  ( 100) 
surface  in  the  dilute  limit  is  shown  in  Figs.  4  (a)  and  4(b), 
respectively. 

The  change  in  electronic  energy  of  the  system  due  to 
the  removal  of  an  isolated  atom  from  the  surface  is  calcu¬ 
lated  from  Eq.  (22),  with  Ap/(£')  given  by  Eq.  (20).  If  the 
repulsive  energy  gained  by  breaking  a  bond  is 
face  sublimation  energy  for  an  atom  in  the  dilute  limit  is 
given  by 

Ed=^E,^,  +  (24) 

Here  /i*  is  the  number  of  bonds  the  adatom  makes  with 
the  surface  and  is  the  electronic  energy  of  the  free 
atom.  Vq  is  chosen  so  as  to  yield  the  correct  bulk  cohesive 
energy  of  0.82,  1.03,  1.63,  and  2.32  eV,  respectively,  for 
HgTe,  CdTe,  GaAs,  and  silicon.  The  SSE  for  an  atom  from 
the  concentrated  limit  is  calculated  in  a  similar  way,  but 
with  Ap/(£)  in  Eq.  (22)  replaced  by  Eq.  (21). 

SSEs  of  anions  and  cations  from  various  semiconduc¬ 
tor  surfaces  in  silicon,  GaAs,  CdTe,  and  HgTe  are  given  in 
Table  I  We  note  several  features  of  the  SSEs:  ( 1 )  the  SSEs 
are  sensitively  dependent  on  the  crystal  orientation;  (2)  in 
general,  they  do  not  vary  linearly  with  the  number  of  bonds 
being  made  to  the  surface;  (3)  the  SSEs  for  cations  and 
anions  differ  and  are  orientation  dependent;  and  (4)  they 
vary  considerably  between  the  dilute  and  concentrated 
limit,  even  for  the  (111)  and  (100)  surfaces  where  there 
are  no  first-neighbor  in-plane  bonds.  The  first  two  points 
are  extremely  important  to  the  modeling  of  MBE  growth. 
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FIG.  2.  (a)  Difference  between  the  surface  DOS  and  bulk  DOS  for  the 
arsenic-terminated  (100)  surface  for  GaAs.  (b)  difference  between  the 
surface  DOS  and  bulk  DOS  for  the  Ga-terminated  (100)  surface  for 
GaAs.  In  Figs.  2-4,  the  charge  in  DOS  is  the  number  of  states  per  elec¬ 
tron  volt  per  cell  summed  over  all  layers. 


where  a  linear  dependence  of  the  surface  bonding  energies 
with  the  number  of  bonds  being  made  is  commonly 
assumed.^’  Because  the  SSEs  do  not  exhibit  this  simple 
linear  variation  with  the  number  of  bonds  being  made  to 
the  surface,  the  growth  properties  based  on  these  energies 
are  expected  to  differ  substantially  from  those  predicted 
using  a  linear  variation  (Sec.  IV).  Although  the  energy 
required  to  remove  a  cation  or  anion  differs  from  that  de¬ 
duced  from  the  bulk  cohesive  energy,  we  find  that  average 
energy  per  bond  for  the  removal  of  an  anion  plus  a  cation 
layer  equals  the  bulk  cohesive  energy,  as  it  should. 

Intuitively,  one  would  exp>ect  >  E^.  The  removal  of 
an  atom  from  the  concentrated  limit,  in  addition  to  break¬ 
ing  interlayer  bonds,  requires  the  breaking  of  in-plane  sur¬ 
face  bonds  which  are  first-neighbor  bonds  for  the  (110) 
surface  and  second-neighbor  bonds  for  the  other  surfaces. 
Even  in  the  absence  of  first-neighbor  bonds  on  the  surface, 
the  surface  dangling  bonds  in  the  concentrated  limit  inter¬ 
act  to  form  a  partially  filled  band,  which  lowers  energy. 


FIG.  3.  Difference  between  the  surface  DOS  and  bulk  DOS  near  the  band 
gap  for  the  ( 1  !0)  surface  for  GaAs. 

When  a  surface  atom  is  removed,  the  removal  of  an  elec¬ 
tron  from  these  broadened  surface  states  will  in  general 
require  more  energy  than  if  the  dangling  states  were  iso¬ 
lated.  as  in  the  dilute  case.  As  seen  from  Table  I,  this 
conventional  interpretation  explains  the  trends  observed  in 
silicon,  HgTe.  and  most  cases  in  GaAs.  However,  for  other 
cases,  we  find  that  is  smaller  than  E^. 

In  order  to  understand  the  mechanisms  that  drive 
some  semiconductors  to  have  E^  <  we  examine  the  na¬ 
ture  of  the  anion  and  cation  surface  states.  Notice  that 
Ef  <  E^  occurs  only  for  pyolar  surfaces  in  semiconductors 
with  a  large  band  gap. 

As  an  illustration,  we  choose  the  (TIT)  surface  of 
CdTe  where  E^  is  less  than  E^.  The  surface  density  of  states 
for  two  ideal  surfaces,  the  Cd-  and  Te-terminated  (TIT), 
are  shown  in  Fig.  5.  For  an  unreconstructed  surface,  some 
of  the  surface  states  normally  lie  in  the  fundamental  gap. 
Compounds  have  both  cation-  and  anion-derived  hybrid 
surface  states,  where  the  cation-derived  most  often  lie 
higher  in  energy.  The  energy  separation  between  the  peaks 
in  the  DOS  in  the  band-gap  region  is  related  to  the  energy 
difference  between  hybrid  states,  and  thus  the  peak  sepa¬ 
ration  is  larger  in  the  II-VI  than  the  III-V  compounds. 

Now  let  us  consider  the  removal  of  a  Cd  atom  from  a 
(ITT)  surface  in  the  concentrated  limit.  The  highest  occu¬ 
pied  surface  level  E„  is  at  the  cation  surface  state  energy, 
1.25  eV  above  the  valence-band  maximum.  When  a  singly 
bonded  cation  is  removed  from  the  surface,  a  surface  state 
from  the  anion  previously  bonded  to  the  removed  cation  is 
created,  and  three  cation  surface  states  are  destroyed.  TTie 
surface  density  of  states  for  the  isolated  anion  surface  state 
created  in  this  process  is  similar  to  that  shown  by  dashed 
line  in  Fig.  5  for  the  pure  anion-terminated  surface,  with 
only  minor  differences  in  the  widths  and  heights  in  the 
peaks.  Because  this  anion  surface  state  lies  near  the  top  of 
the  valence  band,  the  electrons  from  the  cation  surface 
states  will  transfer  into  this  level.  This  process  reduces  the 
energy  of  the  final  configuration,  and  E^  for  removal  of  a 
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FIG.  4.  (a)  Change  In  the  OOS  due  to  the  removal  of  an  isolated  gallium 
atom  from  an  arsenic  ( 100)  surface  (b)  Change  in  the  DOS  due  to  the 
removal  of  an  isolated  arsenic  atom  from  a  gallium  ( 100)  surface. 


cation  in  this  concentrated  limit  is  correspondingly 
reduced.  In  the  present  case,  the  difference  in  the  reduction 
in  energies  due  to  this  charge  transfer  effect  outweighs  the 
surface  state  broadening  effect,  resulting  in  <  Ej.  How¬ 
ever,  for  an  isolated  cation  removed  from  the  (TTT)  anion- 
terminated  surface,  E„  is  at  the  top  of  the  valence  band, 
which  is  very  close  to  the  newly  exposed  anion  surface 
state.  Thus,  in  contrast  to  the  concentrated  limit,  there  is  a 


FIG  5  Density  of  dangling  hybrid  states  of  anions  (dashed)  and  cations 
(solid)  for  ( 1 1 1  )S-onenled  CdTe  £„  is  the  energy  of  the  highest  occu¬ 
pied  level  for  a  cadmium-terminated  ( 1 1 1  )£  surface. 


negligible  reduction  in  E^  due  to  the  charge  transfer  effect. 
Note  that  these  charge  transfer  processes  are  reversed  for 
surface  sublimation  of  anions  from  the  (TTT )  surface.  Since 
three  bonds  are  broken,  more  electrons  are  involved  in  the 
reversed  spillover  and  the  difference  between  E^  and  E^  is 
correspondingly  larger. 

We  conclude  that  the  charge  transfer  from  the  cation 
to  anion  surface  states  Wi.i  increase  E^  and  decrease  E^. 
and  thus  always  reduces  the  difference,  E^—E^.  The  mag¬ 
nitude  of  the  charge-transfer  effect  depends  on  the  amount 
of  charge  transferred  and  the  separation  between  the  rele¬ 
vant  energy  levels.  Whenever  this  effect  is  substantial. 
can  become  smaller  than  E^- 

In  an  elemental  semiconductor  like  silicon,  no  transfer 
is  expected  and  Ec  is  always  expected  to  be  larger  than 
£rf,  as  is  observed.  In  GaAs,  although  a  charge  transfer  is 
present,  the  energy  difference  between  the  E„  and  the 
newly  exposed  dangling  bond  state  is  very  small,  resulting 
in  a  reduced  charge  transfer  effect.  As  a  consequence, 
E^—Ej  is  reduced  and,  in  general,  remains  positive.  Owing 
to  an  increased  ionicity  in  II-VI  compounds,  the  energy 
separation  between  the  dangling  states  is  large.  As  illus¬ 
trated  above,  the  effect  of  the  charge  transfer  is  substantial 
in  CdTe  and  in  most  cases  causes  E^  to  be  less  than  Ej. 
Although  HgTe  is  a  II-VI  compound,  the  charge  transfer 
does  not  occur  because  it  is  a  semimetal  with  no  forbidden 


TABLE  I.  Orientation-dependent  SSE  and  the  highest  occupied  level  (measured  with  respect  to  the  top  of  the  valence  band).  All  energies  are  in  electron 
volts.  7)  is  the  number  of  bonds  wiith  the  layer  below  that  are  broken. 


Orientation 

Removed 

atoms 

V 

Si 

GaAs 

CdTe 

HgTe 

K 

£, 

£c 

E, 

(III) 

a 

1 

0.0 

2.5 

4.6 

0.4 

2.3 

3.8 

0.7 

3.9 

4.2 

0.0 

1.8 

3.8 

c 

3 

0.0 

4.2 

6.4 

0.55 

3.2 

3.3 

1.6 

2.7 

0.6 

0.0 

-0.3 

1.1 

(11  1  ) 

a 

3 

0.0 

4.2 

6.4 

0.0 

4.6 

5.0 

0.0 

5.2 

2.1 

0.0 

2.8 

2.8 

c 

1 

0.0 

2.3 

4.6 

1.3 

3.3 

2.7 

1.6 

2.2 

:.3 

0.0 

-0.2 

0.3 

(100) 

a 

2 

0.0 

3.7 

5.6 

3.4 

4.7 

0.7 

3.0 

2.6 

0.0 

2.4 

3.1 

c 

2 

0.0 

3.7 

5.6 

2.6 

3.6 

1.6 

2.1 

0.1 

0.0 

-0.2 

1.0 
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gap.  The  dangling  states  for  HgTe  are  resonant  in  the  band 
and  E„  is  always  at  the  conduction-band  edge.  Because  the 
cation  surface  states  reside  in  the  conduction  band,  they 
are  never  occupied  and  consequently  a  charge-transfer  ef¬ 
fect  is  not  present.  Thus  for  HgTe.  £<.  is  always  larger  than 
Ej.  In  general,  the  reduction  in  the  difference  of  £<.  and 
Ej  due  to  the  charge  transfer  is  expected  to  be  large  for  all 
wide-gap  II-VI  compiounds. 

The  relative  magnitude  of  the  surface  energies  can 
have  profound  consequences  on  the  growth  properties  of 
the  semiconductor  crystals.  From  the  difference  between 
Ec  and  E^,  an  effective  surface  pair  interaction  can  be  cal¬ 
culated.  When  Ec  <  £</,  the  effective  surface  interatom  pair 
interaction  is  repulsive  and  the  atoms  can  decrease  their 
enthalpy  by  maximizing  their  interaction  separation.  When 
the  effective  surface  interatom  pair  interaction  is  attractive, 
i.e.,  £<■  >  £d,  the  isolated  atoms  can  lower  their  enthalpy  by 
coalescing  and  forming  islands.  In  both  cases,  the  actual 
configuration  of  atoms  in  equilibrium  on  the  surface  will  be 
determined  by  total  free  energy.  Two  types  of  phase  tran¬ 
sitions  can  occur  depending  on  whether  £^  >  Ej  or  E^ 
<  Ej.  When  E^  >  Ej,  the  phase  transition  corresponds  to 
the  spinodal  decomposition  of  surface  atoms  and  surface 
vacancies.  When  E^  <  E^,  the  phase  transition  corresponds 
to  an  order-disorder  transition,  where  the  ordered  struc¬ 
ture  is  a  superlattice  between  the  atoms  and  vacancies  on 
the  surface.  The  particular  surface  superlattice  arrange¬ 
ment  will  depend  on  the  growth  temperature,  growth  sur¬ 
face  orientation,  and  surface  atom  concentration.  For  a 
special  case  that  includes  relaxation,  it  has  been  demon¬ 
strated  by  Chadi,'**  in  agreement  with  experiment,’*’  that  a 
superlattice  will  form  at  ;  Ga  coverage  of  a  ( U 1 )  surface. 
We  predict  small  attractive  interactions  in  this  case,  but 
relaxation  could  change  the  sign  to  agree  with  Chadi’s  re¬ 
sult. 

From  Table  I  we  conclude  that  silicon,  HgTe,  and 
most  GaAs  surfaces  will  exhibit  spinodal  decomposition 
while  most  of  the  CdTe  surfaces  and  the  singly  bonded 
gallium-terminated  (111)  surface  will  undergo  an  order- 
disorder  transition.  The  actual  nature  of  a  surface  during 
growth  will  be  determined  by  the  growth  temperature  with 
respect  to  the  critical  temperature.  We  are  currently  cal¬ 
culating  the  critical  temperature  for  the  various  order- 
disorder  transitions. 

IV.  DISCUSSION 

It  is  difficult  to  compare  the  calculated  surface  subli¬ 
mation  energies  directly  with  experiment.  Experimental 
values  of  the  sublimation  energy  of  gallium  from  the  ( 100) 
GaAs  surface'”^  vary  from  2.9  to  4.8  eV  and  for  the  cad¬ 
mium  and  tellurium  from  the  CdTe  (100)  surface,  values 
vary  between  5  and  4.7  eV  and  0.4  and  2,1  eV, 
respectively.*^*  These  values  are  normally  obtained  from 
the  slope  of  an  Arrhenius  plot  of  excess  partial  pressure  of 
one  constituent  versus  \/T.  The  apparent  discrepancy  in 
the  experimental  values  can  be  explained  even  using  the 
bulk  cohesive  energy  in  the  equilibrium  partial  pressure 
expression.  When  the  cations  vaporize  as  atoms  and  the 
anions  as  diatomic  molecules,  the  product  of  equilibrium 


'partial  pressures  of  constituent  materials  can  be  obtained 
in  terms  of  temperature  T,  atomic  (molecular)  mass  m. 
vibrational  energy  rotational  energy  and  cohesive 
energy  £/,.  It  is  given  by 


Xl 


(kTkT 


(25) 


where  subscript  a(c)  denotes  anion  (cation)  and  super¬ 
script  e  denotes  equillibrium  value.  For  known  values  of 
applied  partial  pressures,  and  P^,  the  excess  pressure 
can  be  calculated  as  function  of  l/T.  The  slope  of  Arrhen¬ 
ius  plots  calculated  using  this  analysis  falls  within  the 
range  of  experiments.  But,  because  the  slopes  thus  mea¬ 
sured  are  dependent  on  the  uncontrolled  overpressures  of 
the  constituents,  as  pointed  out  by  Heckingbottom,*’  these 
slopes  are  not  a  basic  property  of  the  system.  Therefore, 
activation  energies  deduced  from  these  experiments  depend 
strongly  on  particulars  of  the  experiments  and.  hence,  the 
SSE  calculated  in  this  paper  cannot  be  compared  to  exist¬ 
ing  experimental  values.  Yet.  the  pressure  and 
temperature-independent  activation  enthalpies  calculated 
here  are  required  input  for  MBE  growth  models. 

To  illustrate  the  use  of  the  surface  sublimation  ener¬ 
gies,  we  examine  the  MBE  growth  of  Hg).,CdjTe  for 
small  X.  and  demonstrate  the  failure  of  the  cohesive  energy 
in  explaining  one  important  experimental  result.  We  do 
this  using  a  simple  model  for  MBE  growth  which  encom¬ 
passes  only  the  necessary  features  to  demonstrate  the  use  of 
the  SSEs.  Our  calculations  predict  small  mercury  SSE  for 
all  HgTe  surfaces,  in  agreement  with  the  experimental  ob¬ 
servation  that  Hg|  _^Cd^Te  at  low  x  is  especially  difficult  to 
grow  and  requires  a  large  flux  of  mercury.  The  calculated 
values  of  the  SSEs  are  nearly  zero  for  the  dilute  surfaces, 
indicating  that  the  nucleation  of  new  mercury  layers  is 
especially  difficult.  It  has  been  experimentally  observed 
from  MBE  growth  at  185  'C  that,  for  a  given  growth  rate, 
the  (111)  surface  requires  an  order  of  magnitude  more 
mercury  than  the  (ITT)  surface.'”  We  use  our  calculated 
values  of  the  SSE  in  a  simple  growth  model  to  study  the 
corresponding  trends.  The  growth  model  to  be  described 
here  is  similar  to  that  proposed  by  Galliard."  An  improve¬ 
ment  over  the  previous  work  is  the  inclusion  of  the  change 
in  surface  entropy  due  to  the  evaporation  of  an  atom  from 
the  surface.  In  this  model,  the  growth  rate  £  is  a  difference 
between  incoming  and  outgoing  terms: 


(i-x„)yji^-x;0;i/(i-x„)=£, 

=  (26) 

where  a  is  ,4  or  £  corresponding  to  the  (111)  or  the  ( TTT ) 
surface,  is  the  fraction  of  the  surface  covered  by  mer¬ 
cury,  ^Hg  and  J^c  are  mercury  and  tellurium  fluxes,  respec¬ 
tively,  and  0Hg  and  0xe  are,  respectively,  the  mercury  and 
tellurium  evaporation  rates.  This  set  of  equations  can  be 


J  Appi  Ph'’S  .  Vo,.  68,  .'c 


4026 


8,  15  October  1990 


96 


Knshnamurthy  eta! 


4026 


FIG.  6.  Ratio  of  mercury  vapor  pressures,  required  for  same  growth  rale 
in  { 1 1 1  and  ( 1 1 1  )fl  directions,  plotted  as  a  function  of  tellunum  par¬ 
tial  pressures  and  the  growth  temperatures  in  steps  of  5  "C. 


solved  for  and  for  a  given  growth  rate  and  various 
values  of  the  tellurium  overpressure  and  growth  tempera¬ 
tures. 

The  evaporation  rate  0  is  assumed  to  be  equal  to 
where  the  escape  frequency  is  calculated  from  the 
spring  constant  of  HgTe.  The  growth  rate  is  assumed  to  be 
1  /um  per  hour.  The  orientation  and  coverage-dependent 
SSE  are  used  in  the  evaluation  of  the  ratio  of  •^Hg/ for 
various  values  of  tellurium  overpressure  and  growth  tem¬ 
perature.  The  calculated  ratio  is  plotted  in  Fig.  6  . 

It  can  be  seen  that  the  ratio  changes  by  an  order  of  mag¬ 
nitude  when  the  tellurium  pressure  is  increased  from  3  to 
10x10“*’  Torn  Also,  note  that  the  ratio  is  sensitive  to 
growth  temperature,  particularly  at  low  values  of  tellurium 
partial  pressures;  these  conclusions  fall  within  the  range  of 
the  experiment.^*’  In  the  only  reported  experiment/®  the 
tellurium  partial  pressure,  while  held  constant,  was  not 
recorded,  so  the  measured  ratio  of  9  (at  7=  185  ’C)  can¬ 
not  be  compared  directly  with  our  calculations.  However, 
as  can  be  seen  from  Fig.  6,  a  factor  of  9  for  this  ratio  occurs 
in  a  reasonable  range  of  the  tellurium  partial  pressure.  It 
would  not  be  possible  to  obtain  this  result  assuming  a  lin¬ 
ear  dependence  on  the  cohesive  energies  for  the  SSEs.  In 
addition,  our  calculations  suggest  that  mercury  incorpora¬ 
tion  can  be  increased  by  growing  at  low  temperatures  with 
higher  tellurium  partial  pressures.  Limits  will  be  set  by 
surface  migration  rates,  here  assumed  to  be  fast  compared 
to  layer  growth  rates. 

V.  CONCLUSIONS 

We  have  used  the  second-neighbor  TB  Hamiltonian 
and  GF  method  to  study  the  electronic  structure  of  the 
ideal  surfaces  and  those  with  an  isolated  atom  added  to  or 
removed  from  the  surface.  A  computationally  efficient  cal¬ 
culation  is  made  possible  by  using  a  difference  equation 
approach  and  analytical  continuation  technique.  We  have 
obtained  the  SSE  for  various  surfaces  of  HgTe.  CdTe, 


GaAs,  and  silicon.  The  calculated  values  imply  that  the 
SSEs  do  not  increase  linearly  with  the  number  of  broken 
bonds.  Also,  the  SSE  depends  on  the  surface  coverage.  In 
some  cases,  even  when  there  are  no  nearest-neighbor  sur¬ 
face  bonds,  we  find  it  easier  to  remove  an  isolated  atom 
than  to  remove  an  atom  from  a  large  island,  which  pro¬ 
vides  an  additional  reason  to  form  islands.  In  other  cases, 
however,  we  find  that  less  energy  is  required  to  remove  an 
atom  from  a  concentrated  surface  than  is  needed  when  the 
atom  is  isolated.  We  have  used  the  calculated  SSEs  in  a 
simple  model  m  study  the  ratio  of  mercury  partial  pressues 
of  (111)  to  (TIT)  surfaces  for  a  given  growth  rate.  This 
ratio  is  sensitively  dependent  on  tellurium  flux  and  growth 
temperature,  but  our  predictions  fall  within  the  range  of 
experimental  values.  We  further  predict  that  better  incor¬ 
poration  of  mercury  is  possible  at  low  temperature  with 
increased  tellunum  flux. 

Although  the  perturbation  due  to  surface  and  adatom 
is  treated  exactly  in  this  GF  method,  the  accuracy  of  the 
calculated  values  should  be  improved  by  considering  sev¬ 
eral  neglected  effects,  including  self-consistency  and  sur¬ 
face  and  adatom  relaxation  and  reconstruction.  It  is  known 
from  detailed  calculations  that  reconstruction,’’'’' 
dimerization,’^  and  relaxation  of  adatom  and  surface-layer 
lower  surface  energy.’*”  Although  there  is  no  simple  re¬ 
lationship  between  the  surface  energy  and  SSE.  the  values 
calculated  here  will  nevertheless  change  when  these 
nonidealities  are  considered.  For  example,  Northrop”  has 
shown  that  the  preferred  position  for  a  silicon  adatom  on 
the  Si  (111)  surface  (to  which  it  is,  ideally,  to  be  single 
bonded)  is  the  site  where  it  bonds  to  three  silicon  at¬ 
oms.  This  rearrangement  from  the  ideal  to  the  singly 
bonded  site  seems  to  increase  SSE  by  0.7  eV  or  so.  If  the 
surface  grows  reconstructed,  these  nonideality-limited  SSE 
are  the  appropriate  ones  to  use  in  modeling.  While  the 
removal  of  the  approximations  mentioned  earlier  will  re¬ 
fine  our  reported  SSEs.  the  underlying  mechanism  of 
charge  transfer  will  always  be  present.  The  magnitude  of 
effective  surface  interaction  energies  causes  estimated  sur¬ 
face  order-disorder  transition  temperatures  in  excess  of 
typical  MBE  growth  temperatures,  and  consequently  must 
impact  the  nature  of  the  surface  and  its  growth  consider¬ 
ably. 
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The  energy  required  to  remove  an  atom  from  semiconductor  surfaces  is  calculated  using  a  Green’s- 
function  approach.  Contrary  to  intuition,  we  find  that,  in  some  cases,  less  energy  is  needed  to  remove  an 
atom  from  the  nearly  full  surface  than  from  a  nearly  empty  surface.  The  results  are  explained  in  terms 
of  the  relative  energies  of  anion  and  cation  dangling  bonds,  and  the  charge  transfers  between  them.  The 
deducted  effective  pair-interaction  energies  and  their  effects  on  surface  morphology  and  growth  perfec¬ 
tion  are  discussed. 

PACS  numbers:  68.55.Nq,  68.3S.Md 


In  recent  years,  considerable  advances  have  been  made 
in  the  field  of  epitaxial  growth  of  semiconductors,  with 
molecular-beam  epitaxy  (MBE)  being  one  of  the  most 
important  techniques.  Modeling  of  the  MBE-growth 
process  entails  a  description  of  the  incoming  and  outgo¬ 
ing  fluxes  of  atoms  from  the  surfaces,  as  well  as  the  in¬ 
surface  migration  and  reactions  of  the  constituents.  An 
important  contribution  to  the  outflux  arises  from  the 
sublimation  of  the  constituents  from  the  growth  surface. 
Attempts  to  determine  the  activation  energy  for  surface 
sublimation  processes  have  been  made  in  a  variety  of  ex¬ 
periments  on  GaAs  (Refs.  1-3)  and  CdTe.^'^  Because 
the  values  obtained  depend  on  the  experimental  environ¬ 
mental  conditions,^  the  reported  values  differ  substan¬ 
tially.  Moreover,  the  experiments,  which  permit  eva¬ 
poration  of  many  layers,  measure  quantities  related  to 
bulk  cohesive  energies  rather  than  to  the  activation  ener¬ 
gies  that  are  more  relevant  to  growth  and  that  depend  on 
surface  orientation.  Yet  in  the  modeling  of  MBE 
growth,  it  is  essential  to  have  accurate  values  of  the  ac¬ 
tivation  energies  needed  by  the  constituent  atoms  to  es¬ 
cape  from  sites  on  a  given  surface  to  the  vapor.  ^ 

In  this  Letter,  we  present  the  energies  calculated  with 
a  Green ’s-function  (GF)  approach  for  atoms  removed 
from  isolated  positions  on  the  surface  or  from  a  nearly 
full  surface.  The  difference  in  sublimation  energies  in 
these  two  limits  is  explained  in  terms  of  the  energy 
difference  between  the  cation  and  anion  dangling  bonds, 
and  the  charge  transfers  between  those  states.  The  ener¬ 
gies  affect  surface  order-disorder  transitions  and  growth 
perfection,  as  discussed. 

We  use  a  second-neighbor,  tight-binding  Hamiltonian 
with  four  orbitals  per  atom  to  obtain  the  bulk  electronic 
structures. A  tight-binding  GF  method  is  used  to 
calculate  the  surface  sublimation  energies  (SSE).  The 
GF  is  calculated  using  a  novel  and  computationally 
efficient  difference-equation  approach  developed  recent¬ 
ly. "  The  bond  length  and  the  interatomic  and  intra- 


atomic  Hamiltonian  matrix  elements  for  the  atoms  on 
the  surface  are  assigned  their  bulk  values,  and  only  non- 
reconstructed  surfaces  are  considered. 

The  calculation  of  these  SSE  proceeds  in  two  steps. 
First,  the  GF  for  the  ideal  flat  surface  is  calculated  in 
terms  of  the  bulk  GF  using  Dyson’s  expansion.  Bulklike 
and  surfacelike  states  are  treated  on  an  equal  footing, 
and  the  density  of  states  is  calculated  as  a  continuous 
function  of  energy.  The  highest  occupied  state  Em, 
which  can  differ  from  the  valence-band  maximum,  is 
identified  by  applying  the  condition  that  no  net  field  ex¬ 
ists  in  the  ideal  stoichiometric  crystal.  Second,  the 
modified  GF  for  the  surface  with  an  atom  either  removed 
from  or  added  to  the  ideal  surface  is  calculated  in  terms 
of  the  ideal  surface  GF.  The  electronic  contribution  to 
the  total  energy  can  be  obtained  from  the  change  in  the 
density  of  states,  which  is  calculated  directly  from  the 
unperturbed  GF  and  the  perturbed  Hamiltonian.  This 
procedure  avoids  problems  associated  with  calculating 
small  energies  by  taking  the  difference  between  two  large 
energies. 

The  surface  sublimation  energy  is  defined  as  the 
difference  between  the  total  energy  of  a  surface  plus  a 
free  atom  and  that  of  the  surface  with  the  atom  bonded 
to  it.  In  this  convention,  the  atoms  with  positive  SSE  re¬ 
quire  energy  to  go  from  the  surface  to  vapor,  and  those 
whose  SSE  is  less  than  or  equal  to  zero  will  give  up  ener¬ 
gy  in  going  from  the  surface  to  vapor.  In  the  tight- 
binding  model,  the  total  energy  is  written  as  the  sum  of 
the  electronic  energy  of  the  occupied  state  and  the  repul¬ 
sive  energy  between  the  nearest-neighbor  pairs.  We 
demand  that  our  energy  parameters  produce  correct 
bond  length,  cohesive  energy  per  bond  (0.82,  1.03,  1.63, 
and  2.32  eV,  respectively,  for  Te,  CdTe,  GaAs,  and  Si), 
and  band  gap  for  the  bulk.  We  calculate  SSE  for  two 
classes  of  cations  and  anions  for  each  surface. 

When  the  perturbation  to  the  surface  involves  the  re¬ 
moval  of  an  atom  from  the  ideal  surface,  leaving  a  sur- 
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FIG.  I.  Density  of  dangling  surface  hybrid  states  of  anions 
(dashed  line)  and  cations  (solid  line)  for  ( I T I  )-oriented  CdTe. 
Em  is  the  energy  of  the  highest  occupied  level  for  a  Cd- 
terminated  ( 1 1 1  )£  surface. 

bonded  to  the  removed  cation  is  created.  The  surface 
density  of  states  for  the  isolated  anion  surface  state 
created  is  similar  to  that  shown  in  Fig.  1  for  the  pure 
anion-terminated  surface,  with  only  minor  differences  in 
the  widths  and  heights  in  the  peaks.  Because  this  anion 
surface  state  lies  near  the  top  of  the  valence  band,  the 
electrons  from  the  cation  surface  states  will  transfer  into 
this  level. 

As  this  process  reduces  the  energy  of  the  final 
configuration,  the  SSE  for  removal  of  a  cation  in  this 
concentrated  limit  is  also  reduced.  However,  ^r  an  iso¬ 
lated  cation  removed  from  the  underlying  (ill)  anion 
surface,  E„  is  at  the  top  of  the  valence  band,  which  is 
very  close  to  the  newly  exposed  anion  surface  states.  The 
reduction  in  the  SSE  from  charge  transfer  is  smaller 
than  that  in  the  concentrated  case  just  considered.  Ap¬ 
parently,  the  difference  in  the  charge-transfer  energies 
between  Ej  and  Ec  outweighs  the  band-broadening  ener¬ 
gy  gained  in  Ec  and  results  in  Ej  >  Ec.  The  charge- 
transfer  processes  are  reversed  for  surface  sublimation  of 
anion  from  the  (ill)  surface.  Since  three  bonds  are 
broken,  more  electrons  are  involved  in  the  reversed 
charge  transfer,  and  the  difference  between  Ej  and  Ec  is 
also  larger. 

We  conclude  that  the  charge  transfer  between  the  cat¬ 
ion  and  anion  dangling  states  will  always  increase  Ej 
and  decrease  Ec-  The  magnitude  of  this  effect  depends 
on  the  amount  of  charge  transferred  and  the  separation 
between  the  relevant  energy  levels.  Whenever  the  effect 
is  substantial,  £<■  becomes  smaller  than  Ej  and  an 
effective  repulsive  surface-pair  interaction  results. 


Because  silicon  contains  two  identical  atoms  in  the 
unit  cell,  no  charge  transfer  is  expected  and  £«.  is  always 
larger  than  Ej.  The  surface-pair  interactions  are  always 
attractive.  Although  charge  transfer  occurs  in  GaAs,  the 
energy  difference  between  the  maximum  occupied  level 
and  the  newly  exposed  dangling-bond  state  is  very  small. 
The  value  of  Ec  —  £</  is  reduced  to  make  the  surface-pair 
interactions  weak  and  still  attractive  in  most  cases,  but 
the  interaction  is  repulsive  for  the  Ga-terminated  (ill) 
surface.  Because  of  an  increased  energy  difference  in  the 
hybrid  states  in  II-VI  compounds,  the  energy  separation 
between  the  dangling-bond  states  is  large.  In  CdTe,  for 
example,  the  effect  of  the  charge  transfer  is  substantial 
and  causes  Ec  to  be  less  than  Ej  for  every  case  in  Table  I 
except  for  anions  on  the  (111)^4  surface.  Although 
HgTe  is  a  II-VI  compound,  charge  transfer  does  not 
occur  because  HgTe  is  a  semimetal  with  no  forbidden 
gap.  In  HgTe,  the  dangling  states  are  resonant  in  the 
band  and  the  conduction -band  edge  is  always  the  max¬ 
imum  occupied  level.  The  cation  dangling  states  are 
never  occupied  and,  consequently,  charge  transfer  does 
not  occur  and  Ec  is  always  larger  than  Ej.  The  reduc¬ 
tion  in  the  difference  of  Ec  and  Ea  due  to  the  charge 
transfer  is  expected  to  be  large  for  all  wide-gap  II-VI 
compounds. 

The  relative  magnitudes  of  the  surface  energies  have 
profound  consequences  on  the  growth  habits  of  these 
crystals.  When  £<■  is  larger  than  Ea,  the  effective  in¬ 
teraction  between  the  surface  atoms  is  attractive  and  go¬ 
ing  from  below  the  critical  temperature  to  above  it,  the 
surface  will  undergo  a  phase  transition  from  smooth  to 
rough.  The  smooth  surface  will  have  islands  while  in  the 
rough  limit,  the  atoms  will  occupy  the  sites  randomly. 
Well  below  the  critical  temperature,  the  islands  contain 
few  vacancies. 

When  Ec  is  smaller  than  £rf,  however,  the  effective 
surface-pair  interaction  is  repulsive.  At  growth  tempera¬ 
tures  below  the  order-disorder  transition  temperature, 
the  atoms  on  the  surface  will  arrange  themselves  to  max¬ 
imize  the  number  of  atom-vacancy  bonds,  resulting  in 
superlattice  arrangements.  The  preferred  order  superlat- 
tice  arrangements  depend  on  surface  composition.  At 
temperatures  above  the  transition  temperature,  the  ar¬ 
rangement  of  atoms  will  be  nearly  random.  We  con¬ 
clude  from  Table  I  that  Si,  HgTe,  and  most  GaAs  sur¬ 
faces  grow  in  the  smooth-to-rough  transition  region, 
while  most  CdTe  and  singly  bonded  Ga-terminated 
(ill)  surfaces  grow  in  the  superlattice-ordered-to- 
random  transition  region.  The  surface  phase  is  deter¬ 
mined  by  the  growth  temperature.  Hence,  the  surface 
morphology  is  critically  dependent  on  the  nature  and  the 
magnitude  of  the  interactions  between  the  surface  atoms. 

In  this  Letter,  we  considered  only  ideal  surfaces  and  il¬ 
lustrated  that  the  charge-transfer  mechanism  will  always 
reduce  the  effective  atom-atom  interaction  on  the  sur¬ 
face.  However,  it  is  known  from  detailed  calculations 
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that  reconstruction,'’*”  dimerization,'^  and  relaxation 
of  adatom  and  the  surface  layer  lower  the  surface  energy 
substantially.”  '*  Although  the  effective  interaction  de¬ 
pends  on  the  difference  of  EcEj,  effect  due  to  these 
mechanisms  will  have  to  be  included  for  quantitatively 
more  accurate  results.  While  the  removal  of  the  approx¬ 
imations  will  refine  our  reported  values,  the  underlying 
mechanism  of  charge  transfer  will  always  be  present. 
For  one  special  case,  Chaid,'*  in  agreement  with  an  ex¬ 
periment,'’  has  demonstrated  that  a  superlattice  will 
form  at  the  relaxed  i  Ga-filled  (111)  surface  of  GaAs. 
His  predicted  long-range  order  is  mainly  driven  by  the 
same  kind  of  charge  shifts  as  those  we  find  in  our  GF 
method.  The  magnitude  of  effective  surface  energies 
causes  the  estimated  order-disorder  transition  tempera¬ 
tures  to  be  in  excess  of  typical  MBE-growth  tempera¬ 
tures  and,  consequently,  must  impact  on  the  nature  of 
the  surface  and  its  growth  considerably. 
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ABSTRACT 

A  single-and  multilayer  growth  model  is  presented.  Surface  order-disorder  transitions  are 
studied  with  the  entropy  calculated  in  the  Bragg-William  approximation  and  in  the  quasi¬ 
chemical  approximation.  A  plausible  explanation  for  high-quality  growth  obtained  with  energy- 
assistance  is  given.  The  model  has  been  extended  to  study  low- temperature  epitaxial  growth  of 
HgTe  and  CdTe  on  different  surfaces.  The  relevant  surface  energies  are  evaluated  in  a  Green’s 
function  approach. 

INTRODUCTION 

The  theory  of  surface  order  of  solids  is  essential  to  an  understanding  of  their  growth 
properties.  The  mechanisms  dominating  surface  order  depend  on  a  number  of  factors,  including 
the  crystal  orientation,  substrate  temperature,  and  growth  method.  The  atoms  arriving  at  a 
growing  surface  interact  both  with  the  underlying  surface  and  with  one  another.  Their  mutual 
interactions  may  cause  them  to  undergo  an  order-disorder  transition  from  a  “rough”  to  a 
"srrwoth”  (or  ordered  surface)  or  from  a  “rough”  to  a  “superlattice”  (SL)  surface.  In  the  smooth 
limit,  the  newly  arriving  atoms  tend  to  cluster  into  islands;  in  the  disordered  state,  the  atoms  tend 
to  be  randomly  distributed  [1],  and  in  the  SL-ordered  state,  the  atoms  and  vacancies  on  the 
surface  form  long-ranged  ordered  patterns.  For  a  half-filled  cubic  surface,  for  example,  one  SL 
pattern  is  a  checkerboard  arrangement  of  atoms  on  the  available  sites.  The  temperanire  that 
characterizes  the  transition  from  smooth  to  rough  surface  is  called  surface  roughness  transition 
temperature,  Tg.  When  the  growth  temperature  T  is  smaller  than  T^.  the  surface  will  be  smooth. 
The  growth  rates  and  crystal  quality  are  expected  to  differ  in  these  two  limits. 

The  rough-to-smooth  transition  occurs  when  the  effective  interaction  between  surface  atoms 
is  attractive,  while  the  rough-to-SL  transition  is  a  consequence  of  repulsive  surface  atom-atom 
interactions.  Because  repulsive  interactions  among  surface  atoms  have  not  been  previously 
considered  to  be  realistic,  this  type  of  order-disorder  transition  on  the  surface  has  not  b«n  treated 
extensively  in  the  literature. 

In  this  paper,  we  first  study  the  surface  roughness  for  a  cubic  lattice  by  obtaining  the  surface 
entropy  contribution  to  the  change  in  free  energy  in  a  random  approximation  [also  c^ed  Bragg- 
Williams  approximation  (BWA)],  and  in  the  quasichcmical  approximation  (QCA)  with  pair 
interactions.  Based  on  the  difference  between  their  predictions,  we  suggest  a  plausible 
explanation  for  better  quality  growth  obtained  with  energy  assistance.  The  surface  roughness 
model  is  then  extended  to  study  the  surfaces  of  realistic  semiconductors — HgTe  and  CdTe — 
grown  by  atomic  layer  epitaxy  (ALE)  and  by  molecular  beam  epitaxy  (MBE).  The  transition 
temperatures  and  surface  roughness  for  growth  in  various  orientations  are  given. 

SURFACE  ROUGHNESS  THEORY 

Because  of  space  limitations,  the  approach  will  be  discussed  in  brief;  the  details  can  be 
found  in  the  literature  for  the  single-layer  growth  model  with  BWA  [1]  and  for  multilayer  growth 
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model  with  QCA  [21.  We  begin  by  considering  a  monatomic  cubic  solid  grown  from  a  (100)- 
oriented  seed  with  a  smooth  crystal  surface.  The  change  in  enthalpy  and  entropy  are  calculated 
to  obtain  the  change  in  f  ee  energy.  The  most  probable  configuration  is  obtained  by  minimizing 
the  free  energy.  The  atoms  arriving  at  the  surface  make  one  bond  each  with  the  layer  below  and 
above,  and  four  in-plane  surface  bonds. 

In  the  multilayer  growth  model  with  QCA,  we  first  identify  all  distinct  classes  of  pairs  that 
contribute  to  the  free  energy.  Secondly,  we  write  the  probability  for  occurence  of  each  type  of 
pair.  Then,  using  appropriate  factors  for  indistinguishability  of  pairs,  the  entropy  for  different 
layers  is  written  with  appropriate  constraints.  The  free  energy  is  then  minimized  to  obtain  the 
most  probable  pair  distribudon.  In  this  muldlayer  model,  an  atomic  site  in  any  upper  layer  is 
allowed  to  be  occupied,  even  if  the  site  immediately  below  it  is  not  occupied.  In  other  words, 
“over-hangs"  are  permined.  Although  these  configuradons  are  energedcally  less  favorable,  they 
contribute  to  the  entropy,  and  thus  should  be  included  in  the  free  energy.  In  this  model,  we  have 
four  types  of  pairs:  atom-atom,  atom-vacancy,  vacancy-atom,  and  vacancy-vacancy.  For  each 
layer,  there  are  nine  variables  (four  interlayer  pairs,  four  inoalayer  pairs,  and  one  layer 
concentradon)  and  six  constraints  reladng  them.  The  minimizadon  of  free  energy  in  two  of  the 
remaining  three  variables  can  be  carried  out  analydcally.  The  numerical  minimizadon  in  the 
remaining  one  variable  is  then  carried  out.  The  temperature  at  which  the  second  derivadve  of  the 
free  energy  vanishes  for  the  surface  coverage  of  0.5  gives  the  transidon  temperature,  T^. 

The  calculated  layer  concentradons  in  the  three-layer  growth  model  are  shown  in  Figure  1 
as  a  funcrion  of  surface  coverage  averaged  over  three  layers  for  three  growth  temperatures  Tj.  Tj 
and  Ti  (Tj  <  Tc  <  T2)  by  thick-dashed,  dashed,  and  solid  lines,  respccdvely.  The  index  on  the 
curves  represent  the  layer  number.  At  x  =  1,  3N  atoms  have  been  added  to  the  surface,  where  N 
is  the  number  of  sites  per  layer.  In  this  '■onvention.  Layer  1  is  immediately  above  the  seed 
surface.  We  note  that  at  growth  temperatures  near  and  above  T^.  all  three  layers  grow 
simultaneously,  but  with  xj  >  X2  >  X3.  However,  at  temperatures  below  Tj,  the  uppe*  layers  do 
not  grow  undl  the  lower  layers  are  nearly  full.  In  this  limit,  layer-by-layer  growth  takes  place. 

At  temperatures  well  below  T^  the  atoms  arriving  at  the  surface  cannot  move  and  imperfect 
growth  results.  The  quality  of  the  growth  can  be  characterized  by  the  ftacdon,  yp,  of  atom-atom 
pairs  to  the  toul  number  of  pairs  in  a  layer,  yp  has  been  calculated  in  the  BWA  and  QCA. 
Figure  2  shows  that  less  perfect  growth  occurs  in  BWA  case.  However,  if  there  is  energy 
assistance,  where  photons  or  ions  provide  enough  energy  to  permit  the  atoms  to  move  on  the  sur¬ 
face,  then  the  QCA  entropy  is  more  appropriate  and  the  perfecdon  improves  (yp  increases).  The 
reason  for  the  difference  between  yp  in  these  cases  is  evident:  The  BWA  entropy  is  larger,  and 
therefore  it  emphasizes  the  drive  toward  complexity.  This  results  in  layers  at  a  given  coverage 
with  more  imperfecdons.  QCA,  where  the  entropy  is  smaller,  predicts  that  layers  are  more 
nearly  perfect. 

When  the  substrate  temperature  is  sufficiently  low.  the  characterisdc  dme  constant  for  sur¬ 
face  migration  is  larger  than  the  the  equilibradon  dme  for  interacdon  of  the  surface  with  beam 
and  gas  (thought  of  as  a  heat  bath).  Under  these  condidons,  BWA  is  more  appropriate.  How¬ 
ever,  if  the  surface  mobility  is  enhanced  with  an  energy  assist,  the  effective  time  constant  for  the 
migration  is  reduced  (so  that  surface  equilibradon  can  take  place  faster  than  equilibradon  with 
beamor  gas  bath),  then  the  QCA  is  more  appropriate.  This  is  a  plausible  explanation  for  high 
quality  obtained  in  energy-assisted  epitaxial  experiments  [3,4].  The  energy  assist  in  these  experi¬ 
ments  is  sufficient  to  allow  the  needed  surface  mobility. 
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Fig.  1 .  Layer  coverage  as  a  function  Fig  2.  Vanation  of  y^  «wth  average  coverage 

of  average  coverage 

ENERGETICS  AND  GROWTH  MODEL  R  SEMICONDUCTORS 

The  model  described  in  the  previous  section  has  been  generalized  to  the  case  of  diamond 
and  zinc  blende  lattices.  Also,  the  energies  of  interaction  between  pairs  on  the  surface  and  that 
between  layers  has  been  evaluated,  using  a  Green's  function  approach.  We  calculate  the  energy 
to  remove  the  atoms  from  nearly  empty  surfaces,  referred  as  the  dilute  case,  and  that  to  remove 
the  atoms  from  nearly  full  surfaces,  referred  as  concentrated  case  (5).  The  respective  energies 
are  denoted  Ej  and  E^.  The  calculations  were  carried  out  for  the  removal  of  anions  and  cations 
from  various  orientations  of  Si.  GaAs,  CdTe,  jtd  HgTe.  The  details  of  these  calculations  will  be 
published  elsewhere  (5).  We  report  only  HgTe  and  CdTe  results  here. 

In  the  single-layer  growth  model,  we  consider  three  kinds  of  pairs:  namely,  atom-atom, 
atom-vacancy,  and  vacancy-vacancy  and  their  respective  energies  E„,  E,y,  and  £»».  The 
number  of  effective  bonds  an  atom  makes  to  the  substrate,  iiq.  and  the  number  of  surface  bonds. 
Til,  '’F  specified.  The  free  energy  and  the  surface-pair  populations  are  obtained  from 

the  effective  energy  Egff.  given  by  (E,v  -  (Eja+EyvVZ).  Without  calculating  E„,  E,v  or  we 
can  find  E^^  from  the  difference  of  calculated  E^  and  E^,  and  tij  from  Table  I.  In  our 
convention,  a  negative  E^fr  indicates  an  attractive  interaction  between  surface  atoms. 

In  atomic  layer  epitaxy,  grown  from  a  free  atom  vapo'  ohase,  only  one  type  of  atom  species 
(anion  or  cation)  is  present  at  a  given  time,  and  the  growth  is  necessarily  layer  by  layer.  The 
single-layer  model  developed  in  the  previous  section  can  be  applied  to  study  HgTe  and  CdTe 
with  the  calculated  pair  energies.  QCA  is  is  used  in  the  evaluation  of  surface  entropies.  When 
the  pair  interaction  is  attractive,  the  smooth-to-rough  transition  takes  place,  as  the  growth 
temperature  is  varied  near  Tj..  However,  when  the  pair  interaction  is  repulsive,  the  surface  is 
either  rough  or  the  atoms  are  in  a  SL  arrangement  with  vacancies.  Then  the  order-disorder 
transition  takes  place  near  a  different  tj.  The  preferred  ordered  SL  arrangemerts  are 
composition-dependent;  however,  in  regions  of  ordering,  the  surface  will  be  arranged  to 
maximize  the  number  of  atom-vacanc  ,  lairs.  Calculated  values  of  T^  are  given  for  growth  of 
HgTe  and  CdTe  in  (111)  and  (100)  directions  in  Table  I. 
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Table  I.  Extraction  energies  and  cnncal  temperaiures- 


CdTe 

HgTe 

Atom/ 

Ed 

Ec 

Tc 

Ed 

Ee 

Tc 

Surface 

Layer 

no 

ni 

(eV) 

leV] 

(K) 

(eV) 

leV] 

[K] 

(lll)A 

a 

1 

6 

3.9 

4.2 

710 

1.8 

3.8 

4700 

c 

3 

6 

2.7 

0.6 

5000» 

-0.3 

1.1 

3300 

(lll)B 

a 

3 

6 

5.2 

2.1 

7400* 

2.8 

2.8 

0 

c 

1 

6 

2.2 

1.3 

2100* 

-0.2 

0.3 

1200 

(100) 

a 

2 

4 

5.0 

2.6 

5000» 

2.4 

3.1 

1500 

c 

2 

4 

2.1 

0.1 

4200* 

-0.2 

1.0 

2500 

In  our  calculation,  the  pair  interaction  is  always  attractive  in  HgTe  and  the  temperatures 
sh'-wn  in  Table  I  indicate  T,.  for  the  rough- to- smooth  transition.  However,  the  pair  interaction  is 
r. alsive  for  QlTe,  except  for  the  grow  of  a  singly  bonded  anion  layer  on  the  (1 1 1)B  surface. 
Temperatures  1^  denote  the  rough-SL  transition  temperatures.  Because  the  T^  values  are  very 
large.  ALE  growth  of  HgTe  occurs  mostly  in  the  smooth  domain  limit  and  that  of  CdTe  occurs 
mostly  in  the  SL  domain  limit. 

In  layers  grown  by  MBE,  both  anion  and  cauon  atoms  impinge  on  the  surface  at  the  same 
time  and  both  layers  can  grow  simultaneously.  The  way  in  which  the  layer  grows  depends  on  the 
flux  rates  of  the  two  constituents  and  their  respective  sticking  coefficients.  Hence,  an 
appropriately  generalized  multilayer  model  to  that  described  briefly  in  the  previous  section  is 
applicable.  For  simplicity,  we  assume  only  double-layer  growth.  The  interlayer  pair  interaction 
energies  are  obtained  from  Ej  and  tiq.  The  intralayer  paii  energies  for  each  layer  are  obtained 
from  Ej,  Ej,  and  rii.  With  the  surface  entropy  obtained  in  QCA,  the  free  energy  is  calculated  for 
various  values  of  anion  and  cation  layer  coverage.  The  effect  of  anti-site  defects  is  not  included. 

Because  of  the  repulsive  intralayer  pair  energies,  CdTe  surface  is  expected  to  grow  in  SL 
domains.  Hence,  in  addition  to  random  distributions  considered  herev  one  must  consider  all 
ordered  configurations  for  a  given  concentration.  This  case  is  being  studied  in  detail  and  will  not 
be  presented  here.  However,  the  interlayer  and  intralayer  pair  interactions  in  HgTe  are  attractive, 
so  the  double -layer  growth  model  applies  to  this  system. 

When  the  minima  in  the  free  energy  at  a  given  temperature  occur  near  0  and  1.  islands  are 
formed  on  the  surface  that  ate  nearly  fully  occupied  and  the  remainder  of  the  area  is  nearly 
empty.  As  the  surface  coverage  is  increasea,  the  islands  gro  --  in  ^ize  with  a  fixed  vacancy  cc.i- 
centration  until  the  layer  is  fully  grown.  The  vacancies  fill  only  at  the  end  of  the  layer  growth; 
then  the  formation  of  the  next  layer  takes  place.  These  events  are  calculated  by  examining  a 
sequence  of  equilibrium  arrangements.  Realistically,  kinetic  effects  will  prevent  all  vacancies 
from  filling  in  one  layer  before  the  start  of  the  next  layer.  Thus,  we  expect  that  island  growth — 
where  the  vacancy  concentrations  are  low — is  likely  to  correlate  with  high-quality  material. 
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Sequential  double-layer  growth  is  predicted  to  be  the  case  for  HgTe,  for  example,  in  (lll)A 
direction.  In  this  case,  at  185'C  we  predict  smooth  growth  in  ALE  for  both  Hg  and  Te  layers, 
and  in  either  Hg  or  Te  stabilized  MBE  growth  [6]  we  also  predict  smooth  lay , '  growth.  At  this 
same  temperature  on  the  (1 1 1)B  surface  in  ALE  we  find  smooth  growth  for  the  layer,  but 
rough  growth  for  the  Te.  Moreover,  in  this  particular  case,  we  find  Te-stabilized  MBE  growth  to 
be  smooth,  and  Hg-stabilized  MBE  growth  to  be  rough.  In  principle,  it  is  possible  to  find  sinia- 
tions  and  temperatures  where  in  ALE  one  layer  grows  smooth  and  the  other  rough,  and  in  MBE 
the  layers  grow  rough  for  both  stabilizations.  This  provides  a  rationale  for  the  sensitivity  of 
material  quality  to  stabilization  type,  and  temperature. 

CONCLUSION 

We  have  calculated  the  excess  fiee  energy  of  the  surfaces  for  addition  of  several  layers  of 
atoms.  The  surface  energies  that  enter  the  statistical  mechanics  are  found  in  a  Green's  function 
approach.  The  entropy  calculation  employs  a  surface  modification  of  QCA.  While  the  calcula¬ 
tion  only  finds  the  equilibrium  arrangement  of  atoms  for  specified  surface  coverages,  given  rea¬ 
sonable  speculadons  about  kinetic  effects,  we  can  appreciate  how  energy  assist  functions,  in 
principle,  to  ir.iprove  the  quality  of  epitaxial  layers;  the  phenomenology  dominating  HgTe 
growth  for  different  orientations;  and  reasons  to  expect  differences  between  the  MBE  growth 
habits  of  QlTe  and  HgTe. 
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The  current  status  of  ab  initio  methods  to  study  the  properties  of  solids  is  reviewed.  During  the 
past  ten  years,  a  wealth  of  calculations  have  established  that  many  properties  of  solids  can  be 
accurately  calculated  from  fust  principles.  Recent  advances  in  methods  now  make  it  possible  to 
treat  a  number  of  practical  problems  in  materials  science  within  tolerable  times  and  costs. 


Practically  every  solid-state  physics  text  begins  by  pointing 
out  the  difficulty  of  solving  the  Schrddinger  equation  for 
—  10’’  particles,  and  that  numerous  appro.ximations  are  re¬ 
quired  to  make  progress.  Twenty-five  years  ago,  Hohenberg 
and  Kohn'  established  the  framework  for  modern  first-prin¬ 
ciples  calculations  by  proving  that  the  total  energy  of  a  sys¬ 
tem  was  a  functional  only  of  the  electron  density.  This  func¬ 
tional,  however,  is  unknown.  A  subsequent  “local” 
approximation’  made  it  possible  to  cast  the  problem  as  a  set 
of  independent  particles  moving  in  an  effective  “one-elec¬ 
tron"  potential  for  which  explicit  expressions  could  be  ob¬ 
tained.  This  approach,  now  known  as  density  functional  the¬ 
ory  (DFT)  and  the  local  density  approximation  (LDA), 
has  evolved  into  a  parameter-free  theory  capable  of  predict¬ 
ing  structural  and  electronic  properties  with  reasonably  high 
precision.'  However,  until  recently,  the  computational 
methods  required  to  actually  solve  these  equations  were  so 
slow  that  even  on  large  machines  calculations  have  been  re¬ 
stricted  to  simple  systems,  e.g..  silicon  and  GaAs."*  There  has 
been  a  significant  advance  in  computational  methods  in  the 
last  few  years,  making  it  possible  now  to  solve  problems 
much  more  efficiently  than  previously.''"  This  constitutes 
an  impressive  advance  in  science,  but,  perhaps  more  impor¬ 
tant,  a  revolution  in  technology.  Now  it  is  practical  in  sys¬ 
tems  as  complicated  as  HgTe,  CdTe,  and  soon  the  alloys 
Hg|  ,Cd,Te,  to  compute,  for  example,  native  defect  ener¬ 
gies.  "  impurity  substitution  energies.'  impurity  intersti¬ 
tial  energies."  and  diffusion  coefficients."  as  well  as  the 
simpler  bond  lengths,  cohesive  energies,  and  elastic  con¬ 
stants."’  Accurate  calculations  of  defect  electrical  activity 
and  transport  properties  based  on  these  parameter-free  theo¬ 
ries  are  within  reach.  This  capability  should  have  a  dramatic 
impact  on  device  processing  and  performance  design. 

The  purpose  of  this  paper  is  to  review  the  status  of  this 
advance  and  to  provide  the  reader  with  some  insight  into  its 
prospects.  The  nature  of  the  analysis  is  illustrated  schemati¬ 
cally  in  Fig.  1 .  It  begins  with  a  trial  density  n,„ .  often  a  super¬ 
position  of  free-atomic  densities.  Using  the  LDA  (see  Fig. 
1 ).  this  density  generates  an  effective  one-electron  potential 
and  a  Hamiltonian,  which  in  turn,  generates  an  electron 
density  is  mixed  with  n,,,  and  the  cycle  is  repeated 

until  self-consistency  is  reached,  i.e.,  =  n,„.  Coulomb 

and  magnetic  interactions  as  well  as  relativistic  terms  are 
included.  The  Schrddinger  equation  must  be  solved  in  terms 
of  a  basis  set.  Two  leading,  generally  applicable  basis  sets  are 
the  linearized  augmented  plane  wave  ( LAPW ) ' '  and  linear 
muffin-tin  orbital  ( LMTO ) '  methods.  For  any  basis  set,  the 


computational  time  for  a  solution  using  conventional  tech¬ 
niques  is  proportional  to  .V  \  where  .V  is  the  number  of  basis 
states  per  unit  cell.  The  output  is  the  wave  functions  '!'(/•), 
and  the  corresponding  electron  density  (r). 

The  exact  functional  that  generates  V,„  from  n,„  is  not 
known,  and  in  practice  it  is  possible  to  do  this  only  within  the 
LDA,  for  which  explicit  expressions  can  be  calculated.  The 
LDA  has  been  shown  to  produce  good  valence  band  states' 
in  semiconductors  and  good  structural  properties  in  most 
materials,  e.g.,  atomic  volume,  cohesive  energies,  and  elastic 
constants."’  The  band  gap  and  conduction  band  states  are 
less  accurately  predicted  in  LDA.  However,  it  has  recently 
been  show  n  that  these  properties  can  be  accurately  calculat¬ 
ed  using  a  technique  called  the  “GW“  approximation.’’ 
GW  IS  the  first  term  in  a  perturbation  expansion  of  the  exact 
many-body  Schrodinger  equation.  Very  good  optical  prop¬ 
erties  have  been  obtained  in  semiconductors  when  the  LDA 
is  used  as  a  starting  point  to  calculate  GW,  the  Green's  func¬ 
tion  G  and  screen  Coulomb  interaction  W. 

Among  the  basis  sets  used  to  solve  the  Schrodinger  equa¬ 
tion.  the  pseudopotential  method  is  the  simplest  because  it 
uses  plane  waves.  Plane  waves  (PW)  are  suitable  only  for 
smooth  potentials  as  they  are  eigenfunctions  of  free  elec¬ 
trons.  but  they  are  unsuitable  for  systems  with  d  electrons, 
such  as  CdTe  and  HgTe  Even  in  a  simple  semiconductor 
such  as  Si.  100  to  1000  PW  are  needed  per  atom,  depending 
on  the  precision  sought.  Because  the  computation  time  var¬ 
ies  as  .V the  direct  application  of  this  method  is  limited  to 
problems  with  relatively  few  atoms  per  unit  cell.  Car  and 
Parrinello'  devised  an  improvement  to  this  method  in  which 
the  Hamiltonian  itself  is  allowed  to  evolve  while  iterating  for 
the  eigenstates.  The  computation  time  for  one  iteration  in 
their  method  is  proportional  to  NM where  A/ is  the  number 
of  occupied  states.  This  advance  permits  much  larger  sys¬ 
tems  to  be  treated,  since  M4N in  that  method. 

A  second  important  method  is  LMTO.  It  employs  a  much 
more  efficient  basis  set  with  only  ~20  orbitals/atom  re¬ 
quired  to  reach  convergence.  The  LMTO  method,  until  re- 


Schrodinger  Poisson  Eqn. 

H(V,n) — §21 -  'Mrj.nouilr)  — •2EI - Voui?) 

Until  Vooi  =  V,„ 

Fk,.  I  Scll-^.■onsl^l^.•^cv  li«)p 
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cently,  employed  the  atomic  spheres  approximation  (ASA) 
in  which  within  each  iteration  of  the  self-consistency  loop 
the  potential  is  approximated  by  its  spherical  average  about 
the  ions  and  some  interstitial  positions  (empty  spheres).^ 
This  approximation  limited  the  applicability  of  the  method 
to  problems  of  high  symmetry;  it  excluded  problems  like 
substitutional  impurity  atoms,  or  surfaces  from  those  for 
which  the  precision  was  limited  by  LDA.  Two  years  ago,  a 
full-potential  LMTO  (FP-LMTO)  was  devised*’  that  re¬ 
moved  the  ASA,  thereby  opening  a  whole  new  range  of  ap¬ 
plications.  The  Car  and  Parrineilo  technique  still  works  in 
FP-LMTO  but  does  not  yield  much  speed  enhancement  be¬ 
cause  the  basis  set  is  already  small.  There  is  another  advance 
of  value,  however:  the  Harris-Foulks  functional.'^  ‘‘‘  This  is 
a  technique  for  selecting  a  density  functional  that  is  insensi¬ 
tive  to  details  of  the  electron  density  and  obviates  the  need 
for  the  self-consistency  loop.  The  technique  increases  the 
speed  of  calculations  with  precision  still  limited  by  LDA. 

A  concrete  example  of  the  speed  of  current  calculations  is 
presented  in  Table  1.  The  example  selected  is  a  calculation  of 
the  tellurium  antisite  defect  in  CdTe  using  a  supercell  of  64 
atoms.  The  quantities  calculated  are  the  substitution  energy, 
the  impurity  levels,  and  strain  distributions  around  the  re¬ 
laxed  antisite. 

In  this  approach,  the  crystal  is  treated  as  a  periodic  ar¬ 
rangement  ofsupercells  with  one  antisite  in  each  cell.  When 
the  cell  size  is  large  enough  so  the  antisites  no  longer  interact 
with  one  another,  then  the  calculated  properties  converge  to 
those  of  an  isolated  antisite.  We  find  that  a  cell  size  of  32 
atoms  is  often  sufficient  but  the  numbers  in  Table  I  are  for  64 
atom  cells. 

The  estimated  computation  times  required  to  solve  this 
problem  on  an  Apollo  DN10.(X)0  computer,  using  a  single 
CPU  are  listed  in  Table  I.  This  machine  is  small  enough  to  sit 
in  a  closet,  costs  <5100,000,  and  is  run  by  our  professional 
staff.  Its  clock  time  is  roughly  1/6  that  ofaCra\ ,  but  because 
the  Cray  is  a  vector  processor,  it  runs  this  kind  of  problem 
about  30  times  faster  than  the  Apollo.  Quoted  times  and 
precisions  are  approximate,  and  four  iterations  to  self-con¬ 
sistency  are  assumed. 

As  you  can  see  from  the  table,  conventional  pseudopoten¬ 
tial  methods  are  out  of  the  question  for  a  problem  with  64 
atoms  per  unit  cell.  However,  the  Car-Parrinello  (for 

T  xm.r  I  Computational  effort  to  calculate  an  antisite  defect  in  CdTe  using 
a  supercell  approach  (64  aloms/cell).  Time  is  for  an  Apollo  DN  10.000. 
single  CPU  Times  and  precisions  are  approximate  Four  iterations  to  self- 
consistcncy  are  assumed. 


Method 

Orbitals/atom 

Time  (h) 

Precision  (eV) 

Pseudopoiential ' 

90 

10  (XX) 

0  2 

3(X) 

<  10' 

LDA" 

LAPW 

50 

5000 

0.2 

100 

40  000 

LDA" 

Car-Parrinello' 

300 

1  OOO 

LDA" 

LMTO-ASA 

13 

100 

0.5 

LMTO-FP 

22 

500 

LDA" 

'  Pseudop<4tential  form  unsuitable  for  i/-band  materials 
LDA  error  is  unknown  but  is  expected  to  be  about  0  1  eV. 
‘  Unsuitable  for  calculating  lattice  relaxations 


smaller  problems)  and  LMTO  methods  are  quite  practical. 
A  time  of  100  h,  or  about  four  days,  for  an  ASA-LMTO 
solution,  on  a  machine  that  sits  in  a  closet  operating  in  the 
background,  is  perfectly  acceptable.  Our  machine  actually 
has  three  CPUs  and  can  accommodate  four,  so  if  one  person 
IS  doing  a  production  run  it  does  not  seriously  affect  the  rest 
of  the  group.  If  the  Harris-Foulks  functional  is  used,  a  factor 
of  four  is  gained.  A  time  of  500  h  for  FP-LMTO  requires 
some  patience,  but  even  this  is  acceptable.  Using  an  optimal¬ 
ly  vectorized  and  parallelized  code  can  greatly  reduce  this 
time:'’  indeed,  we  are  now  testing  an  experimental  version 
of  the  ASA  program  that  does  this  calculation  in  about  10  h. 
using  the  three  CPUs  in  parallel.  We  wish  to  emphasize  that 
problems  involving  64  atoms  per  unit  cell  are  well  outside 
the  capabilities  of  earlier  methods  and  remain  time  consum¬ 
ing  even  using  FP-LMTO.  However,  a  problem  dealing  with 
host  material  properties  with  only  two  atoms  per  cell,  e.g.. 
the  band  structure  or  cohesive  energy  r,„  (64/2) '  =  3  X  10^ 
times  faster,  so  instead  of  500  h  the  results  are  obtained  in 
about  one  minute. 

In  the  past,  solid-state  theory  was  able  only  to  calculate 
general  trends  of  phenomena  from  first  principles,  or  with 
more  accuracy  predict  interrelations  between  various  obser¬ 
vables,  i.e.,  parametrize  potentials.'  Parametrized  potentials 
generally  can  be  trusted  only  in  circumstances  where  the 
local  atom  arrangement  deviates  little  from  the  one  for 
which  the  parameters  were  chosen.  Hence,  for  example,  if 
parameters  are  chosen  to  fit  bulk  properties,  quantitative 
predictions  of  surface  reconstruction  may  not  be  trustwor¬ 
thy.  although  the  symmetry  may  be  properly  given.  The  new 
ab  initio  methods  reviewed  here  do  not  suffer  from  this  kind 
of  uncertainty  and.  as  a  consequence,  can  be  trusted  to  with¬ 
in  a  given  precision.  This  ability  should  be  an  invaluable  help 
in  interpreting  observations,  devising  means  to  circumvent 
deleterious  effects,  and  designing  manufacturing  processes. 
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Abstract.  We  review  the  current  status  of  knowledge  of  fundamental  properties  of 
the  alloy  Hg,  _^Cd.Te.  The  most  vexing  questions  are  about  its  correlation  state. 
Several  different  experiments  now  suggest  it  is  highly  correlated,  but  no  theory 
predicts  this  result.  We  also  discuss  other  properties,  including  dislocations  at 
interfaces,  the  residual  donor,  worms,  surface  segregation  and  its  impact  on 
passivation,  and  concentration  fluctuations.  The  forces  driving  these  phenomena, 
where  they  are  known,  will  be  presented.  Most  of  the  paper  focuses  on  the 
following:  correlations;  native  defects,  formation  enthalpies  and  entropies:  native 
defect  equilibria  with  mercury  gas  and  with  tellurium  inclusions:  and  self-diffusion 
coefficient  activation  energies  including  its  contribution  from  migration  energies. 
We  will  take  advantage  of  new  first-principles,  high-accuracy  calculations  to  help 
explain  the  experimental  situation.  The  calculations  predict  that  the  main  native 
defects  found  in  alloys  equilibrated  at  low  Hg  pressures  are  Hg  vacancies,  while  at 
high  Hg  pressures  they  are  Hg  interstitials,  and,  surprisingly,  Hg  antisites. 


1.  Introduction 

As  we  have  been  told  in  this  conference  [1],  sophisticated 
focal  plane  arrays  (fpas)  of  sacs  as  large  as  256  x  256 
elements  have  been  demonstrated  on  Hgj_j,Cd,Te  ma¬ 
terials  grown  epitaxially  by  several  means  on  various 
substrates.  Control  of  the  material  and  its  processing  is 
reaching  a  stage  where  practical  yields  of  arrays  with 
adequate  performance  for  some  applications  are  nearly 
at  hand,  so  these  array  costs  are  becoming  acceptable. 
While  the  community  is  now  well  into  a  manufacturing 
phase,  there  remain  many  unanswered  questions  about 
the  nature  of  Hgi  _^Cd,Te  alloys,  questions  whose  reso¬ 
lution  would  improve  the  performance,  yield  and  stabili¬ 
ty  of  focal  plane  arrays.  The  purpose  of  this  paper  is  to 
review  the  current  status  of  our  knowledge  of  this  alloy 
system  [2,  3],  with  the  aim  of  suggesting  potential 
mechanisms  for  the  uncertain  properties  and  additional 
experiments  to  test  these  hypotheses.  We  also  will  dem¬ 
onstrate  the  merit  of  recently  developed  first-principles 
computational  methods  [4]  to  help  settle  outstanding 
problems  associated  with  the  material  which  will  even¬ 
tually  lead  to  accurate  processing  and  performance 
models.  Figure  1  depicts  schematically  the  major  issues 
that  will  be  addressed.  They  were  selected  because  of 
their  potential  impact  on  performance  of  fpas.  The  figure 
shows  a  passivated  HgCdTe  sample  on  a  substrate.  On 
the  right  is  a  graph  of  a  vertical  composition  profile 
through  one  region  of  the  sample,  and  on  the  left  is  a 

tThis  work  was  supported  in  part  by  NASA  Contract  NASI-I8226. 
ONR  Contract  N000I4-88-C-O096.  and  AFOSR  Contract  49620-88- 
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blow-up  of  a  ‘microcluster’.  The  major  topics  identified 
are  point  defects  [2],  conrelations  [3],  ‘worms’  [5],  misfit 
dioslocations  [6,  7],  substrates,  extended  defects  [1], 
inclusions  [7],  bulk  concentration  fluctuations  [8]  and 
surface  segregation  [9].  Other  topics  such  as  metalliza¬ 
tion  [10],  particularly  to  p-type  material,  could  have  also 
been  included  in  this  list,  but  we  limit  our  discussion  to 
those  mentioned  above.  We  begin  by  summarizing  below 
the  most  important  aspects  of  each  of  these  topics.  A 
selected  set  will  be  greatly  expanded  upon  in  later 
sections  of  the  paper. 

1.1.  Point  defects 

Most  Hg,_,Cd,Te  alloys  are  Te  rich  as-grown  [1,  2], 
The  excess  Te  is  accommodated  by  Hg  vacancies  (desig¬ 
nated  V„,)  or  as  Te  solid  inclusions  (Te,)  [7, 1 1].  The  Hg 
vacancies  are  shallow  acceptors  (possibly  negative-C/- 
centre  double  acceptors)  [12],  With  the  exception  of 
some  MBE  or  mocvd  material,  as-grown  material  is  p- 
type.  both  at  room  temperature  and  at  77  K  [13,  14], 
This  material  can  usually  be  annealed  at  low  temperature 
( ~  250  °C)  in  an  Hg  overpressure  to  fill  the  sites  and 
to  dissolve  the  Te  inclusions  [7],  When  this  is  done  the 
material  converts  to  n-type  with  carrier  concentrations 
that  are  remarkably  low,  from  a  few  times  10‘*  cm  "  *  to  a 
few  times  10*’  cm" ^  in  high-purity  material  [14],  The  n 
dopant(s)  (called  the  residual  donors)  are  unknown. 
There  is  evidence  [14]  that  the  principal  residual  donor  is 
an  impurity  rather  than  a  native  defect,  but  because  it  is 
so  pervasive  the  evidence  in  not  completely  convincing. 
The  residual  donor  concentration  is  unknown,  uncon- 
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FIgura  1.  Schematic  features  of  HgCdTa  alloys. 


trolled,  and  sets  fpa  limits  for  most  operating  modes. 
Identification  and  control  of  the  residual  donor  would 
undoubtedly  be  beneficial.  It  has  been  demonstrated  by 
diffusion  measurements  that  samples  annealed  in  higher 
Hg  pressures  contain  Hg  interstitials  (Hg,)  [15,  16],  but 
while  they  are  likely  to  be  donors,  the  electrical  state  of 
an  Hg,  has  never  been  directly  identified.  It  has  been 
speculated  that  Te  interstitials  (Te,)  and  antisites  (Tch,) 
[17]  are  active  recombination  centres,  but  direct  evi¬ 
dence  on  their  deep  states  is  not  available.  The  Hg 
antisite  (Hgr,)  has  not  been  previously  suggested  as  a 
high-concentration  defect,  but  we  will  present  arguments 
in  a  later  section  that  it  may  be  present  in  concentrations 
sufficient  to  ,Tect  devices  [2]. 

So  far  we  have  mentioned  only  native  defects,  but 
impurity  doping  is  now  being  brought  under  control  in 
HgCdTe  alloys.  It  has  proved  to  be  difficult  to  gain 
control  of  both  donor  and  acceptors,  but  methods  are 
now  in  hand.  Well  behaved  impurities  are  In  [18]  on 
cation  sites  serving  as  donors,  and  As  on  Te  sites  serving 
as  acceptors.  They  are  inserted  into  the  material  during 
epitaxial  growth.  Ion  implantation  and  diffusion  doping 
have  not  been  successful  enough  to  be  used  in  any 
manufacturing  process.  Partial  explanations  for  these 
observations  have  been  published,  but  refinements  are 
needed  to  make  them  quantitative.  Many  devices  are  still 
made  utilizing  V„,  acceptors  and  residual  donors  [19]. 


1,2.  Correlations 

It  is  now  well  recognized  that  semiconductor  alloys  of  the 
form  are  rarely  truly  random.  The  A  and  B 

atoms  do  not  occupy  their  sublattice  in  the  zinc  blende 
structure  at  random  but  are  somewhat  correlated,  so 
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there  is  a  degree  of  short-range  order,  and  in  some  cases 
even  long-range  order  [20,  21].  The  principal  driving 
terms  for  correlations  are  bond  length  and  chemical 
difference  between  the  constituents  AC  and  BC.  In 
Hg,.„Cd,Te  the  bond  lengths  of  HgTe  and  CdTe  are 
nearly  equal,  and  the  differences  in  their  chemical  terms 
are  also  small,  so  that  correlations  are  predicted  to  be 
small.  However,  a  high  degree  of  correlation  has  been 
deduced  using  five  different  experimental  techniques  (Te 
nuclear  magnetic  resonance  (nmr)  [22],  Raman  spec¬ 
troscopy  [23,  24],  infrared  (ir)  reflection  spectroscopy 
[25]  and  x-ray  diffraction  [26]).  In  the  concentration 
range  near  x  =  0.25,  the  correlations  are  such  that  the 
material  is  tending  toward  an  ordered  structure  rather 
than  spinodal  decomposition.  Those  findings  may  have 
important  consequences  for  devices,  in  particular  if  the 
correlation  state  varies  spatially.  Such  correlation  fluc¬ 
tuations  may  be  a  source  of  spatial  variability  of  material 
properties  that  occurs  even  if  the  composition  is  uniform, 
with  the  variation  affecting  the  uniformity  of  the  band- 
gap,  native  defects  and  impurity  concentration,  mobili¬ 
ties,  etc.  The  magnitudes  of  such  variations  may  differ 
greatly  from  one  property  to  another.  For  example,  one 
would  expect  relatively  slight  bandgap  variations,  but 
rather  more  substantial  differences  in  vacancy  concentra¬ 
tions  and  diffusion  coefficients  between  regions  of  differ¬ 
ent  correlation  states.  This  occurs  because  band 
structures  are  properties  that  depend  on  site  occupations 
averaged  over  many  lattice  spacings,  but  vacancy  for¬ 
mation  energies  depend  sensitively  on  the  local  atom 
ar'angement  around  the  vacant  site.  Both  the  experi- 
Hicntal  and  theoretical  understanding  of  the  correlation 
state  of  HgCdTe  need  to  be  clarified.  A  more  detailed 
exposition  of  this  problem  is  presented  in  section  2. 
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U.  ‘Worms’ 

Material  grown  by  lpe  and  movpe  and  annealed  to  n 
type  often  exhibits  low  carrier  concentrations  ~  10*’ 
cm  ■  *  but  with  electron  mobilities  far  below  t  hose  of  bulk 
or  MBE-grown  matenals.  If  these  materials  are  intention¬ 
ally  impurity  doped  to  be  slightly  more  n  type,  their 
mobilities  return  to  the  high  values  obtained  by  other 
growth  methods.  The  currently  accepted,  although  un¬ 
confirmed,  explanation  for  this  phenomenon  is  that  after 
the  Hg  overpressure  anneal,  there  are  ‘woodworm-like’ 
domains  in  the  n-type  material  that  remain  p  type  [S], 
These  regions  produce  p-n  junctions,  and  mobilities 
measured  in  a  Hall  measurement  exhibit  a  mixed  effective 
mobility  that  appears  low.  impurity  doping  converts 
these  p-type  worms  to  n  type,  so  while  there  is  still  a 
spatial  variation  of  the  doping  concentrations,  all  the 
material  is  n  type  so  the  measured  Hall  mobility  is  higher. 
From  the  viewpoint  of  device  limits,  such  impurity 
doping  has  two  potential  deleterious  effects.  First  the 
minimum  carrier  concentration  that  can  be  used  is 
increased,  and  second  there  is  a  spatial  variation  of  the 
carrier  concentration. 

The  worms  are  an  appealing  explanation  for  the 
observed  transport  properties,  but  if  the  goal  is  to 
improve  devices  made  from  lpe  material  then  we  must 
understand  why  the  worms  form  and  how  to  eliminate 
them  during  growth,  or  in  subsequent  processing.  Do  the 
worms  form  because  they  conespond  to  regions  of  low 
residual  donors,  or  to  regions  where  the  V„,  formation 
energies  are  small  so  that  they  do  not  anneal  completely? 
Are  the  worms  regions  where  the  correlation  state  differs 
from  the  remaining  material?  Why  are  worms  more 
prevalent  in  lpe  or  movpe  material  than  in  higher- 
growth-temperature  bulk  material  or  lower-growth-tem- 
perature  mbe  material?  These  and  other  questions  are 
still  unanswered. 

1.4.  Misfit  dislocations 

Substantial  misfit  dislocation  densities  are  alv  ;.'s  ^ound 
at  the  interface  between  Hgi_j,Cd,Te  and  substrate 
materials,  e.g.  Cd,  _^n,Te,  even  when  the  two  materials 
are  perfectly  lattice  matched  [6,  7],  This  situation  also 
occurs  to  a  lesser  but  still  significant  extent  at  heterojunc¬ 
tions.  The  reasons  for  these  anomalous  dislocation 
densities  remain  to  be  determined.  Because  dislocations 
have  been  demonstrated  to  have  a  deleterious  impact  on 
minority  carrier  lifetimes,  their  effect  is  of  particular 
concern  at  heterojunctions  where  they  degrade  RqA 
products  [27-29], 

1.5.  Substrates 

This  topic  has  been  extensively  reviewed  in  other  confer¬ 
ence  papers  [1].  The  ideal  substrate  material  would  be  a 
large-area,  iR-transparent  insulator  that  is  matched  to 
the  active  layer  by  lattice  constant,  thermal  expansion 
coefficient  and  chemical  potential,  and  on  which  signal 
processing  circuitry  could  be  integrated.  This  idealization 


does  not  exist  for  Hg,  _,Cd,Te,  so  compromises  must  be 
made.  The  materials  CdZnTe,  and  GaAs  or  Si  with  buffer 
layers,  are  the  current  leading  contenders. 

1.6.  Extended  defects 

A  variety  of  extended  defects,  grain  boundanes,  anti¬ 
phase  boundaries,  twin  planes,  threading  dislocations, 
stnations,  etc  have  all  been  directly  observed  in  transmis¬ 
sion  electron  microscope  (tem )  studies  or  their  existence 
deduced  from  indirect  evidence.  Their  impact  on  device 
characteristics  is  not  well  established,  but  is  certainly 
never  helpful.  Because  large-area  fpas  are  being  made, 
extended  defects  obviously  are  relatively  dilute  or  ineffec¬ 
tive  in  current  device-grade  material.  However,  improve¬ 
ments  are  still  possible. 

1.7.  Inclusions 

We  have  already  mentioned  Te  inclusions,  and  they  will 
be  analysed  more  extensively  in  later  sections  of  this 
paper.  They  always  have  an  array  of  dislocations  around 
them  [7],  are  thought  to  serve  as  getters  for  some 
impurities,  and,  as  we  suggest  later,  are  likely  to  have  a 
non-equilibrium  atmosphere  of  and  Te„,  around 
them  [2].  When  they  are  dissolved  in  the  Hg  anneal, 
some  of  their  associated  dislocations  climb  to  surfaces 
[7]  and  are  eliminated,  although  some  remain;  the 
gettered  impurities  may  remain  as  inclusions,  and  the 
larger  Te,  inclusions  may  not  be  completely  dissolved. 
Means  to  avoid  Te,  inclusion  formation  in  as-grown 
material  are  most  desirable. 

1.8.  Bulk  concentration  fluctuations 

Even  if  an  alloy  is  random,  any  segment  in  space  has  a 
Bernoulli  probability  concentration  distribution  [30]. 
Thus  one  expects  small-scale  concentration  fluctuations 
if  there  is  nothing  to  suppress  them.  Because  of  the  near 
lattice  match  between  HgTe  and  CdTe.  a  region  of  space 
with  a  concentration  that  differs  significantly  from  the 
average  produces  no  long-range  strain  field  and  is  there¬ 
for*  not  strain  suppressed.  The  same  effect  has  been 
found  in  the  lattice-matched  alloy  Al,_„GajAs 
[31],  However,  in  a  lattice-mismatched  material  like 
Hgi_j,Zn,Te.  such  short-range  concentration  fluctua¬ 
tions  are  suppressed.  One  consequence  of  this  is  that  in 
HgCdTe  the  exciton  line  is  very  broad,  while  in  HgZnTe 
it  is  narrow  [8].  This  effect  should  be  particularly 
troublesome  in  vlwir  material  where  the  bandgap  is  very 
small  and  fluctuations  could  produce  semi-metal  short¬ 
ing  domains. 

1.9.  Surface  segregation 

rn  e,ci'.erai,  an  alloy  in  equilibrium  will  not  have  a 
uniform  concentration  near  an  interface  [9].  This  is  most 
easily  understood  at  a  vacuum  surface  where  there  are 
dangling  bonds.  The  material  can  minimize  its  free 
energy  by  having  the  alloy  species  that  makes  the  weak¬ 
est  bonds  concentrate  at  the  surface.  Also  in  lattice- 
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mismatched  alloys  the  low -concentration  species  will  be 
driven  to  the  surface  by  ‘strain  release’  because  they 
expenence  less  strain  energy  at  the  surface  than  in  the 
bulk.  In  the  near-lattice-matched  Hg,-,Cd,Te  alloy, 
strain  release  plays  little  part,  and  therefore  surfaces  are 
Hg  rich,  because  Hg  maxes  weaker  bonds  to  Te  than 
does  Cd.  Detailed  predictions  are  that  for  x  <  0.4  the 
surface  layer  will  have  concentrations  in  the  semi-metal 
range  returning  to  the  bulk  concentration  within  a  few 
atomic  layers.  Thus,  in  equilibrium,  mwir  and  lwir 
HgCdTe  alloys  will  have  Hg-rich  semi-metal  surfaces.  A 
CdTe-passivated  HgCdTe  alloy  will  have  a  uniform 
concentration  up  to  the  interface  because,  both  in  the 
bulk  and  at  the  interface,  cations  will  be  surrounded  by 
four  Te  atoms,  so  there  is  no  driving  force  for  segregation. 
There  is  also  little  dr  ving  force  for  segregation  at  a  ZnTe 
interface  with  HgCdTe.  These  conclusions  are  true  only  if 
interdiffusion  is  prohibited  across  the  interface. 

Because  there  is  less  tendency  for  interdiffusion  be¬ 
tween  ZnTe  and  Hgi_,Cd,Te  than  between  CdTc  and 
the  alloy,  we  have  suggested  that  a  few  atomic  layers  of 
ZnTe  followed  by  a  thicker  CdTe  layer  should  be  a 
superior  passivant. 

A  CdS/HgCdTe  interface  should  be  Cd  rich  because 
the  CdS  bond  strength  exceeds  that  of  HgS  by  more  than 
that  between  CdTe  and  HgTe.  A  similar  argument  ap¬ 
plies  to  oxides,  except  that  the  lattice-constant  mismatch 
becomes  so  large  that  dislocations  and  grain  boundaries 
begin  to  exert  a  much  larger  influence  on  the  net  result. 

The  rapid  return  of  the  interface  concentration  to  its 
bulk  value  (a  few  atomic  layers)  is  predicted  to  occur  only 
above  an  alloy’s  order-disorder  transition  temperatures 
[3,  32],  Below  this  temperature,  long-range  large-excur¬ 
sion  concentration  oscillations  should  occur.  Because  of 
its  expected  low  order-disorder  temperature,  one  would 
tend  to  discount  such  phenomena  from  consideration  in 
HgCdTe.  However,  the  observations  of  large  correlation 
effects  makes  us  pause.  If  these  effects  are  confirmed,  then 
there  may  be  comparatively  long-range  concentration 
fluctuations  adjacent  to  an  epitaxial  layer-substrate  in¬ 
terface,  which  may  help  to  explain  the  anomalous  misfit 
dislocation  densities  that  are  observed.  This  suggestion 
(first  made  by  Spicer  et  al  [10],  but  without  a  mechanism 
identified)  is  highly  speculative,  and  while  concentration 
oscillations  driven  by  surface  segregation  and  interface 
boundary  conditions  are  a  natural  consequence  of  a 
higher  than  expected  order-disorder  transition  tempera¬ 
ture,  there  is  no  direct  evidence  to  support  its  occurrence 
at  a  HgCdTe/CdTe  interface.  Relevant  experiments 
would  be  helpful. 

The  remainder  of  this  papier  will  be  devoted  to  an 
exposition  of  our  current  understanding  of  correlations, 
native  defects  and  their  diffusion. 


2.  Review  of  correlations 

We  [21]  and  others  [33-35]  have  demonstrated  that 
there  are  always  correlations  in  alloy  semiconductors. 
Here  we  summarize  our  previous  work,  focusing  atten- 
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tion  on  those  aspiects  that  may  help  us  to  identify  what 
may  be  missing  from  the  interactions  dnving  correlations 
in  HgCdTe.  We  consider  an  average  population  Xj  of 
five-atom  anion-centred  clusters  including  16  bonds 
(most  others  treat  four  bonds  only).  The  population  of 
five-atom  clusters  of  the  form  A- ..  B,  C  can  be  shown  to 
be  grand  canonical  ensembles  of  the  form 

Xj  =  gj  exp[(/ig/ij  -  Cj)/kn/q({clJ]],  mb)  ( 1 ) 

where  the  partition  function  is 

j 

Ha)  ^  'L  9i  exp[(/iB";  -  £j)/<cri  (2) 
1-0 

and  is  the  degeneracy  of  cluster  j  =  0, 1 . J  J  < 

2*),  rtj  is  the  number  of  B  atoms  on  cluster  j,  e.j  is  the 
excess  energy  of  cluster  j  relative  to  the  virtual  crystal 
approximation  (vca)  average  Evca  =  (1  ~  *)£a  + 
(called  Ay  by  Sher  et  al  [21]),  and  Eg  are  cluster 
energies  of  the  pure  AC  and  BC  compiounds.  k  is  the 
Boltzmann  constant  and  Tis  the  absolute  tempierature. 

If  the  clusters  do  not  have  their  normal  counting 
degeneracies 


split,  then  we  have  gj  =  g°  the  binomial  coefficient,  and 
i  =  4.  The  chemical  ]>otcntial  is  set  by  insisting  that  the 
composition  x  is  correct 

4x  =  ^  Iiy  Xj.  (3) 

J 

If  one  makes  a  transformation  to  the  reduced  excess- 
energies  representation 

Ay  =  £y-(^1  -  (4) 

(called  A}  by  Sher  et  al  [21])  then  Xj  becomes 

Xj  =  gj  cxp[(pi}nB  -  ^j)kT]/q({^j},  /ig)  (5) 

where 

Atg  =  /tg  —  (Ej  —  £o)/^- 

Note  that  we  have  Aq  =  Ay  =  0,  so  at  least  two 
members  of  the  set  {A,}  vanish.  More  impxirtantly,  only 
the  reduced  excess  energies  actually  drive  the  cluster 
populations.  We  shall  sec  shortly  that  the  energies  of  the 
{Ay}  set  are  often  five  to  ten  times  smaller  than  those  of 
the  {cy}  set.  Most  of  the  strain  contribution  to  the  {cy}  set 
subtracts  out  and  contributes  little  to  the  Xy  values. 

There  is  an  additional  effect.  It  can  best  be  appre¬ 
ciated  when  stated  as  a  theorem:  If 


then  J  =  4  and 

Xy  =  xf  =  g^(l  -  xy*-‘>x"<  (7) 

where  x°  are  the  piopulations  of  a  random  alloy.  Thus,  no 
matter  how  large  the  constant  S  may  be,  as  long  as  the 
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energies  Ay  vary  linearly  with  the  number  of  B  atoms  on 
the  cluster  rij  and  the  counting  degeneracy  is  not  split,  the 
alloy  IS  random.  Strain  energies  from  bond-length  differ¬ 
ences  between  the  constituents,  chemical  energies  and 
electron-electron  interactions  all  have  some  nonlinear 
variations  with  rij.  The  degeneracy  g°  is  split  by  coherent 
strains,  temperature  gradients,  or  anything  that  estab¬ 
lishes  preferred  directions  for  the  locations  of  A  and  B 
atoms  on  a  cluster.  These  splittings  always  establish  a 
preference  for  one  type  of  cluster  in  a  particular  orienta¬ 
tion  [e.g.  for  a  strain  in  the  (1 1 1)  direction,  AjB  with  the 
B  oriented  along  the  (111)  axis],  and  therefore  drive 
deviations  of  the  {xy}  set  from  {xj}  toward  compound 
formation,  and,  if  the  splittings  are  large  enough,  toward 
long-range  order.  The  effect  of  the  energies  {Ay}  can  drive 
the  {xy}  set  toward  either  compound  formation  or  spino- 
dal  decomposition,  depending  on  details. 

The  excess  free  energy  of  an  Aj.^B.C  alloy  can  be 
written  as 

Af  =  A£  -  TAS  (8) 

where  the  excess  energy  is 

A£  =  AfX£jiy  (9) 

J 

with  M  being  the  number  of  clusters.  The  excess  entropy 
is 

with  My  -  XyM  (see  Sher  et  al  [21]  for  the  justification). 
The  total  number  of  clusters  M  is  related  to  the  number 
of  Bravais  lattice  sites  N  in  the  crystal  by  M  =  (V/4  for  a 
16-bond  cluster  (and  M  =  Af  for  a  four-bond  cluster). 
Equations  (9)  and  (10)  can  be  rewritten  for  the  16-bond 
cluster  as 

A£  =  ^^(1  -  X)  £(,-(-  X£y  -H  (11) 

and 

AS  =  —  k)V^(l  —  x)ln(l  —  x)  X  In  X 

M  _  \ 

+  Inxy-xylnx}')! 

=  Njx  +  ^lnq({^,},n'a).  (12) 

Note  in  the  first  equality  in  equation  (12)  that  M/N  is 
1  for  a  four-bond  and  1/4  for  a  16-bond  cluster.  In  the  16- 
bond  cluster  case,  there  is,  as  a  consequence,  an  exact 
cancellation  of  the  (1  —  x)  In  (1  —  x)  -)-  x  In  x  term  by  a 
portion  of  the  third  term  in  the  bracket.  This  occurs 
physically  because,  in  the  16-bond  cluster,  all  the  bonds 
associated  with  a  given  substituted  atom  are  in  the 
cluster.  In  the  four-bond  case,  however,  bonds  from  each 
substituted  atom  contribute  to  four  different  clusters. 
Several  features  of  equations  (11)  and  (12)  are  note¬ 
worthy.  First,  £o  and  Cj  are  functions  of  x  but  arc 


temperature  independent.  For  bond-length-mismatched 
alloys,  they  vary  with  x  roughly  as  £o  %  £Sjc*  and  Ej  %  eJ 
(1  —  x)^,  so  that  we  have 

A£  ;=  x(  1  -  X)  -K  X  Vy j  ( 1 3) 

where  the  mixing  enthalpy  parameter,  fi  is  approxi¬ 
mately 

n  [£?  +  (£?- £?)x]/4  (14) 

and  nearly  x  independent  if  e^  %  £j,  as  is  often  the  case. 
The  second  term  in  equation  (13)  is  usually  small  com¬ 
pared  with  the  first  and  contains  all  the  temperature 
dependence.  However,  when  equations  (5),  (12)  and  (13) 
are  inserted  into  equation  (8)  to  obtain  A£,  a  term  in  TAS 
exactly  cancels  the  second  term  in  equation  (13).  Thus,  it 
is  impossible  to  determine  the  temperature  variation  of 
A£  from  a  measurement  of  A£.  The  expression  for  Af 
becomes 

Af  =  N[flx(l  -x)-t-x/rB-ilcrin4({Ay},  Mb)].  (15) 

The  chemical  potential  Mb  is  determined  from  equa¬ 
tion  (3),  and  both  mb  and  q  are  generally  temperature 
dependent. 

We  have  calculated  {ej},  (Ay),  mb.  {xj  -  xJ},  ^E(T) 
and  Af(D  for  Hg,  _,Cd,Te  (figure  2)  and  Hg,  .^n,Tc 
(figure  3)  alloys.  The  energies  £y  for  these  figures  were 
calculated  as  discussed  in  detail  in  [21].  The  16-bond 
cluster  was  attached  to  a  rigid  medium  at  the  virtual 
crystal  positions  in  the  third  shell  from  the  cluster  centre. 
The  atom  positions  inside  the  third  shcU  were  adjusted  to 
minimize  the  strain  and  chemical  energies.  The  energy  £y 
is  the  total  energy  of  the  16  bonds  with  the  atoms  in  their 
minimum-energy  positions.  The  variations  of  atom  bond 
lengths  in  different  alloys  are  well  predicted  by  this 
method.  The  Af  curves  for  Hgi_,Cd,Te  show  no  tenden¬ 
cy  for  an  order-disorder  transition  in  the  temperature 
range  studied,  but  Xy  —  x®  values  do  deviate  from  zero. 
Hg,  .jjZn^Te  does  have  an  order-disorder  spinodal  tran¬ 
sition  with  critical  [36]  temperature  s  380  K.  Al¬ 
though  Hg,  _„Cd,Te  shows  significant  deviations  of  the 
Xy  from  random  alloy  values  for  a  sample  equilibrated  at 
300  K,  they  are  still  not  as  large  as  those  measured  in  the 
NMR  experiments,  meaning  that  a  major  interaction  may 
be  missing.  However,  at  300  K  diffusion  may  be  fast 
enough  in  HgCdTe  for  the  material  to  equilibrate  even  at 
this  low  temperature  (see  the  discussion  in  the  next 
section).  The  slower  diffusion  observed  in  HgZnTe  may 
be  a  distinct  advantage,  one  that  counters  its  above- 
room-temperature  critical  temperature. 

We  have  shown  that  only  the  small  (Ay)  drive  devia¬ 
tions  in  the  populations  x,  away  from  randomness. 
Moreover,  if  Ay  =  rtjd  is  linear  in  rij  the  number  of  B 
atoms,  then  despite  the  size  of  S,  the  populations  are  still 
random.  Thus,  effects  that  would  otherwise  be  considered 
small  may  compete  with  the  larger  energies  retained  if 
they  have  the  proper  nonlinearity.  We  are  examining 
several  possibilities  in  Hg,  _,,Cd„Te.  These  include  effects 
caused  by  screening  in  the  composition  range  where  the 
alloy  is  a  semi-metal,  direct  second-neighbour  chemical 
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Figure  2.  Composition  variation  of  correlation-state-dependent  quantities  tor  Hg,  _,Cd.Te. 


interactions  treated  in  the  context  of  Harrison’s  model 
[37],  electron-electron  Coulomb  interactions  driven  by 
polarity  differences  between  the  constituents,  and  com¬ 
position  variations  of  the  elastic  constants. 


dure.  Because  we  do  not  have  access  to  the  original  data 
we  have  corrected  published  numbers  by  assuming  the 
straight-line  fits  of  In  G  versus  l/T  were  made  through 
the  points  Tj  and  T*  given  by 


3.  Nativa  dafacts  and  diffusion 

Tj-'  =  i(2rr'  +  Tj-*) 

T;‘  =  i(rr'  +  2r2-‘) 

Then  the  correct  expression 

(17) 

3.1.  Data  fitting 

G  =  C(P)T"exp(-£/kD 

(18) 

There  have  been  several  measurements  [15, 16]  of  the  Hg 
vacancy  formation  energy  and  Hg,  Cd  and  Te  tracer 
diffusion  coefficients.  These  measurements  are  always 
fitted  to  a  functional  form 

G  =  GJP)exp(- E'/k-n  (16) 

where  £'  is  interpreted  as  the  activation  energy  and 
G„(P)  is  the  infinite  temperature  limit  of  G.  For  vacancy 
formation,  G  represents  the  vacancy  concentration,  and 
for  diffusion  the  infinite  temperature  diffusion  coefficient. 
It  is  assumed  in  the  fitting  process  that  G„(P)  can  be 
viewed  as  a  function  only  of  the  partial  pressure  P  above 
the  sample  of  the  species  under  investigation.  However, 
the  pre-exponential  coefficient  always  has  a  power-law 
temperature  contribution,  i.e.  a  T"  multiplicative  com¬ 
ponent.  Because  the  measurements  extend  over  a  limited 
temperature  range,  Tj  ^  Tj,  usually  have  some  scat¬ 
ter,  and  the  7’"’  term  varies  more  slowly  than  the  expon¬ 
ential,  reasonable  fits  to  the  data  are  obtained.  However, 
when  £'  values  are  quoted  to  more  than  one  significant 
figure,  and  G„  to  the  proper  order  of  magnitude,  it  is 
important  to  include  the  T"  factor  in  the  fitting  proce- 


parameters  are  related  to  the  fitted  parameters  by  the 
relations 


and 


£  =  £'  —  mkr 


where 


m  CTl 


r  =  (Tj-  ‘  +  HTJT^)  S  r*. 


(19) 

(20) 

(21) 


The  approximate  expression  to  r  in  equation  (21)  is 
accurate  if  T*  —  T3  <1  T*.  Note  that  if  m  is  positive  £  <  £' 
and  if  m  is  negative  £  >  £'. 

To  illustrate  the  effect  of  these  corrections,  we  have 
examined  the  case  of  Vh,  formation  and  diffusion  and  the 
interstitial  Hg,  diffusion  in  Hgo  78Cdo.22Te.  The  mass 
action  equation  for  formation  of 

HgTe«>V„,Te  +  Hg,  (22) 

where  Hg  represents  a  Hg  atom  on  a  Hg  site,  and  Hg, 
represents  Hg  in  the  vapour  phase,  has  been  analysed 
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Figure  3.  Composition  variation  of  correlation-state-dependent  quantities  for  Hg,  _^n.Te. 


using  standard  results  in  statistical  mechanics.  The  diffu¬ 
sion  coefficient  D  is  given  by 


where 

7  =  (24) 

and  where  a  is  the  hop  distance,  is  the  mean  free  time 
between  hops,  n/N  is  the  fraction  of  lattice  sites  N 
occupied  by  vacancies,  is  the  presentation  frequency 
of  an  atom  at  the  barrier  between  it  and  an  adjacent 
vacant  site  and /,.  is  the  probability  that  an  adjacent  atom 
reaches  the  intermediate  position  at  the  peak  of  the 
barrier  between  it  and  an  adjacent  vacancy.  (The  prime  is 
inserted  to  remind  us  that  this  is  not  a  normal  interstitial 
because  it  lies  between  two  vacant  sites.) 


The  quantity  /;•  is  calculated  roughly  from  the  mass 
action  equation  for  the  effective  reaction 

HgTe  +  V„„N>V„,Te  +  Hg,  (25) 

again  remembering  that  the  HgTe  is  adjacent  to  a  V„,  in 
the  initial  state. 

Then,  the  assumption  that  the  Hg,  in  the  interme¬ 
diate  position  does  not  remain  long  enough  for  its 
vibrational  states  to  thermally  equilibrate  leads  to  the 
power-law  m  values  quoted  in  table  1. 

There  are  several  noteworthy  features  to  the  informa¬ 
tion  in  table  1.  First,  the  corrections  to  the  activation 
energies  are  10  to  20  %,  which  is  important  if  the  second 
and  third  significant  figures  quoted  are  to  be  taken 
seriously.  Second,  note  that  for  both  vacancy  and  inter¬ 
stitial  diffusion  in  the  Chen  [16]  and  in  the  Tang  and 
Stevenson  [15]  work  a  larger  activation  energy  corre¬ 
lates  to  a  larger  pre-exponential  factor.  This  occurs 


Table  1.  Corrections  to  experimental  activation  energies  and  pre-exponential  coefficients. 


Experiment 

£'(eV) 

(cm=  s-’) 

7-,  (K) 

T^{K) 

T 

£(eV) 

C 

m 

Vacancy  formation 

Vydyanath  [13,  14] 

2.24 

400 

650 

495 

2.01 

11/2 

Vacancy  diffusion 

Chen  [16] 

2.40 

4.7  X  10®  P„- ’ 

400 

490 

441 

2.08 

3.2  X  10-*’  P„-'t 

17/2 

Tang  and  Stevenson  [15] 

2.10 

4.87  X  10*  P„-’ 

350 

500 

412 

1.80 

5.8  X  10-“P„-'t 

17/2 

Interstitial  diffusion 

Chen  [16] 

0.54 

1.1  X  10- 

400 

490 

441 

0.64 

5.5  X  10-’®P„,t 

-5/2 

Tang  and  Stevenson  [15] 

0.61 

5.5  X  10- "P^^ 

350 

500 

412 

0.70 

2.4  X  io-’®p„,: 

-5/2 

t(cm*s-’)-  r’"* 
Jem  -  r*^s-’. 
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because  of  the  T*  dependence  and  the  fact  that  the  two 
experiments  were  conducted  over  different  temperature 
intervals  T,  and  T^- 

The  corrected  numbers  for  the  two  groups  are  closer 
together  than  the  uncorrected  ones  in  both  cases,  though 
still  not  in  perfect  agreement.  If  we  average  the  corrected 
Hg  diffusion  activation  energies  of  the  two  experimental 
groups  we  get 

to,v„.=  1.94^0.2  eV 

£o,„„  =0.67  ±0.04eV 

3.2.  Native  defects 

Much  of  the  following  discussion  has  appeared  pre¬ 
viously  in  [2].  As  mentioned  earlier  the  doubly  ionized 
cation  vacancy  is  believed  to  be  the  dominant  native 
defect  in  HgCdTe  that  dictates  the  electrical  behaviour  of 
the  undoped  material  [13, 14].  We  have  recently  reported 
the  results  of  calculations  of  native  and  impurity  defect 
total  energies  in  HgTe,  CdTe  and  ZnTe  [18].  The  ener¬ 
gies  for  the  formation  of  various  native  point  defects  in 
HgTe  are  summarized  in  table  2;  the  HgTe  solid  and  the 
Hg  in  the  vapour  are  used  as  reference  energiest.  These 
calculations  used  the  linearized  muffin-tin  orbital  (lmto) 
method  within  the  local  density  approximation  (loa)  to 
the  exchange  correlation  energy.  Large  supercells  con¬ 
taining  one  defect  per  cell  were  repeated  periodically,  and 
from  the  difference  in  total  energies  pc  cell,  with 
and  without  the  defect,  the  defect  formation  energy  was 
calculated.  To  expedite  the  calculations,  the  energies 
were  calculated  within  the  atomic  spheres  approximation 
(asa)  with  a  small  (spd)  basis  set.  In  the  asa,  an  approxi¬ 
mation  to  the  exact  density  functional  is  evaluated:  as  a 
result,  an  error  is  introduced  which  is  larger  than  in  other 
LDA  methods  [38],  and  relaxation  energies  cannot  be 
accurately  calculated.  Thus  only  those  differences  in 
energies  that  are  >  0.5  eV  here  should  be  viewed  as 
significant  for  these  calculations. 

An  appropriate  set  of  mass  action  constants  for  the 
neutral  defect  reactions  is  also  given  in  table  2.  The 

t  A  differem  reference  is  used  here  than  was  used  in  table  III  in  [18]. 
An  error  appears  in  that  table  owing  to  the  incorrect  use  of  an 
energy  of  2£„  per  unit  cell  rather  than  4£„.  The  defect  energies  in 
table  I  in  [18],  from  which  the  energies  m  table  III  were  denved,  are 
correct. 


notation  in  that  table  is  as  follows.  Ag  corresponds  to  an 
A  species  occupying  a  B  site,  where  1  corresponds  to  an 
interstitial  and  V  to  a  vacancy  No  subsenpt  on  a  species 
indicates  that  it  is  occupying  the  correct  lattice  site,  e  g 
Hg  =  HgH,.  Square  brackets  [  ]  refer  to  concentrations 
A  subsenpt  'g'  indicates  the  speaes  in  the  gaseous,  or 
vapour,  phase,  and  is  the  Hg  vapour  pressure.  Most  of 
the  reactions  in  table  2  involve  the  creation  or  destruc¬ 
tion  of  one  or  more  unit  cells  Because  the  resulting 
change  in  volume  is  accommodated  at  the  surface,  the 
change  in  the  number  of  unit  cells  will  enter  into  the 
determination  of  the  defect  equilibnum  through  the 
surface  entropy.  Additionally,  surface  preparation  and 
onentatjon  will  affect  the  surface  free  energy.  We  have 
assumed  for  the  present  that  such  surface  effects  are 
negligible,  i.e.  that  the  volume  expansions  and  contrac¬ 
tions  can  occur  with  negligible  changes  in  the  surface 
properties. 

To  complete  the  defect  equilibrium  determination 
correctly  we  must  include  the  equilibration  of  the  elec¬ 
tronic  charges  of  the  system.  To  do  so  we  must  have 
knowledge  of  the  dominant  charge  states  of  the  defects 
and  their  activation  energies  with  respect  to  the  neutral 
defect.  Such  calculations  are  complicated  by  the  fact  that 
most  ab  miiio  calculations  of  the  electronic  band  structure 
of  semiconductors  predict  an  incorrect  bandgap,  Eq,  a 
shortcoming  of  the  local  density  approximation  (lda). 
Therefore,  we  shall  focus  on  the  neutral  native  defects 
here,  and  the  established  or  expected  charge  states  of 
these  defects. 

In  wtde-bandgap  materials  the  equilibration  of  de¬ 
fects  can  be  substantially  affected  by  the  Fermi  level;  for 
example  the  formation  energy  of  a  donor  will  decrease 
when  the  Fermi  energy  is  near  the  valence  band  edge, 
since  the  donor  electron  can  drop  into  a  vacant  state  near 
the  valence  band,  thereby  lowenng  the  energy  by  Eq 
Because  we  are  discussing  HgCdTe  with  a  narrow  band- 
gap.  we  expect  the  Fermi  effects  to  be  small,  but  not 
insignificant  at  high  temperatures.  Because  of  the  small 
conduction  band  effective  mass,  in  n-type  material  the 
filling  of  the  conduction  band  states  by  electrons  can  shift 
the  Fermi  energy  significantly.  Combined  with  the  in¬ 
crease  in  the  bandgap  for  the  high  temperatures  at  which 
most  defect  studies  are  done,  the  effective  bandgap  can 
be  substantially  larger  than  the  usual  77  K  bandgap 
associated  with  a  given  concentration  of  HgCdTe. 


Table  2.  Detect  reactions  and  tormation  energies. 


Defect  reaction 

Defection  concentration 

Energy  (eV) 

+  HgTe-V„,Te  -i-  Hg, 

2.01  + 

£r^  +  2HgTe«Te„,Te  -i-  2Hg, 

[TeH,!  =  exp(- Ej^IkT) 

4.53 

Et*  +  HgTe»-Te,XeV  Hg, 
fv,.  +  Hg,«HgV,.  \ 

[fe,!  =  expi-EjJkT) 

4.96 

IVt.I  =  Phb<,.  exp(-  E^JkT) 

3.12: 

Eh.,.  +  2Hg,  — HgHgT,'^ 

[Hgr.]  =  exp(  - 

-0.42 

Eh.,  +  HQg  —  Hg, 

lHg,l  =  P„,K®„  exp(  -  £„„/kr) 

0.84,  0.98 

t  Corrected  experimental  number  from  Vydyanath  (13,  14) 
^Calculated  using  a  tight-binding  Hamiltonian  [17]. 
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First  we  consider  the  defects  which  accommodate 
excess  tellurium  — the  first  three  defects  in  table  2.  The 
mass  action  constants  are  given  by  a  product  of  the  form 

K,  = /(:"exp(- A£,/fer).  (26) 

For  the  first  three  aquations  the  K°  are  given 
by 

=  Co(/£71*'2(2;r/n„,)^'^/i’^  exp(ASv„,/<c)  (27) 
^¥ch.  =  C„(fcD>(27rm„,)^)i-"  exp(AST.„./fc)  (28) 

and 

^Te.  =  Co(/c70’'^(27tm„,)^'^)i'^  exp(AST.,/fe)  (29) 

Here  T  is  the  temperature  in  keivin,  k  is  Boltzmann’s 
constant,  is  the  mass  of  the  mercury  vapour  atoms,  h 
is  Planck's  constant,  is  the  change  in  vibrational 
entropy  upon  formation  of  the  defect  and  Cq  converts 
from  site  fraction  to  volume  concentrations.  Estimates, 
valid  at  high  temperature,  of  the  temperature  variation  of 
the  ASj  terms  weie  included  in  the  pre-exponential  de¬ 
pendence  T”  of  the  reaction  constants  in  table  1.  Because 
two  unit  cells  of  HgTe  are  destroyed  when  a  tellurium 
antisite  is  created,  compared  with  one  unit  cell  when  a 
mercury  vacancy  is  created,  we  do  not  expect  that 
exp[(AST.„,  -  ASv„,)/^Tl  *  1.  While  we  have  not  com¬ 
pleted  the  evaluation  of  these  entropy  terms,  our  preli¬ 
minary  estimates  indicate  that  this  ratio  is  ~  10*.  For  the 
tellurium  interstitial  and  the  mercury  vacancy,  we  expect 
that  expClASj,,  —  ASv„,)/kT]  ^  1  will  be  correct  within 
a  factor  of  10.  Evaluating  the  numerical  constants  we  find 


_  lO-lO 

[Vh.]  ■ 

(30) 

i  10" 

[V„.] 

(31) 

for  r  =  500  ’C  and  =  1  atm.  The  conclusion  from 
equations  (30)  and  (31>  that  the  mercury  vacancy  is  the 
dominant  native  defect  is  consistent  with  experimental 
observation.  This  conclusion  is  unchanged  if  we  include 
the  possibility  that  the  species  may  be  ionized  at  the 
equilibration  temperature  where  the  material  is  expected 
to  be  intrinsic.  Although  the  tellurium  antisite  density 
decreases  more  rapidly  with  decreasing  Hg  pressure  than 
does  the  mercury  vacancy  density,  the  point  at  which  the 
concentrations  are  comparable  is  at  less  than  P„,  = 
10" atm,  and  certainly  the  HgTe  phase  boundary  is 
reached  before  such  low  Hg  pressures  can  be  achieved. 
This  is  also  consistent  with  the  fact  that  no  p-to-n 
conversion  is  observed  in  isothermal  anneals  for  low 
mercury  pressures  [13,  14,  38],  as  would  be  expected  if 
tellurium  antisites  became  the  majority  native  defect. 
Because  the  pressure  dependences  of  the  tellurium  inter¬ 
stitial  and  the  mercury  vacancy  concentrations  are  the 
same,  the  above  conclusions  will  hold  independent  of  the 
mercury  pressure. 

We  have  checked  the  sensitivity  of  the  calculated 
concentration  ratios  to  the  magnitude  of  the  reaction 
enthalpy.  Because  the  enthalpies  enter  the  exponents, 
small  changes  in  the  enthalpies  will  result  in  large 


changes  in  the  predicted  defect  concentrations.  For  ex¬ 
ample.  let  us  assume  that  our  calculated  antisite  forma¬ 
tion  enthalpy  is  in  error  by  0.5  eV :  in  this  case  the  ratio  of 
antisite-to-vacancy  concentrations  (at  500  “O  will  be 
reduced  to  ~  1  x  10"“.  For  an  antisite  formation  cn- 
tnalpy  in  error  by  1.0  eV.  this  ratio  is  reduced  to  ^  1  x 
10"  '.  We  do  not  expect  the  asa  errors  ,j  exceed  0.5  eV 

[5]. 

If  the  HgCdTe  is  not  completely  annealed,  and  tellur¬ 
ium  precipitates  are  still  present,  the  defect  equilibrium 
will  not  be  that  predicted  by  the  mass  action  equations 
given  in  table  2.  For  example,  near  the  inclusions  we  can 
assume  that  the  defects  will  be  nearly  in  equilibnum  with 
the  tellurium  solid;  thus 


-f  Te.-VH,Te 

(32) 

2Te,«Te„,Te 

(33) 

will  be  the  appropriate  reactions.  The  formation  energies 
for  a  tellurium  antisite  and  an  Hg  vacancy  from  the 
tellurium  solid  are  calculated  to  be  1.63  eV  and  1.15  eV, 
respectively.  Although  the  difference  in  the  formation 
energies  is  less  than  when  both  defects  are  referenced  to 
the  mercury  vapour  ( ~  0.5  eV  compared  with  ~  2  eV), 
the  gas  phase  entropy  factor  docs  not  enter  into  the  ratio 
of  the  defect  concentrations.  Using  the  same  estimate  of 
the  entropy  ratio,  the  defect  concentration  ratio  using 
tellurium  solid  as  the  reference  state  is 

(34) 

L'' H|J 

Thus,  near  the  inclusions  we  expect  higher  relative 
concentration  of  tellurium  antisites,  as  compared  with 
the  rest  of  the  materia!  equilibrated  with  the  Hg  vapour. 
Additionally,  the  absolute  [V„j]  defect  concentrations 
may  differ  substantially  in  the  two  regions  of  the  mater¬ 
ial.  At  present  we  expect  [V„,]  to  be  higher  in  an 
‘atmosphere’  surrounding  a  Te  inclusion.  A  better  calcu¬ 
lation  of  the  vibrational  entropy  is  needed  before  we  can 
predict  these  absolute  defect  concentrations  and  their 
spatial  extent.  Differences  in  the  defect  concentrations 
arising  from  different  equilibration  conditions  are  a 
possible  source  of  spatial  variability  of  the  HgCdTe 
material.  If  the  material  is  not  fully  annealed  to  equilib¬ 
rium,  for  example  because  of  an  abundance  of  tellurium 
precipitates,  this  history  may  affect  subsequent  process¬ 
ing. 

In  the  above  we  have  discussed  the  defect  energies  for 
HgTe  and  applied  them  directly  to  the  small-x 
Hg,-xCd,Te  system.  Because  we  are  dealing  with  the 
native  defects  of  an  alloy,  we  expect  a  number  of  com¬ 
plexities  to  affect  the  above  analysis.  First  the  variation  of 
the  defect  formation  energies  for  vacancies  is  sensitive  to 
the  alloy  environment,  in  particular  for  the  vacancies  of 
the  non-substituted  species,  such  as  tellurium  in  HgCdTe 
[39].  Even  for  vacancies  of  the  substituted  species,  we 
have  found  that  the  formation  energies  may  vary  by 
several  tenths  of  an  electron  volt.  Because  of  this  varia¬ 
tion  in  the  formation  energy,  the  fraction  of  defective  sites 
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will  vary  by  as  much  as  a  factor  of  100  from  one  class  of 
sices  to  Che  next.  Consider  various  classes  of  Hg  sices  m 
ideal  HgCdTe.  which  can  be  discinguished  to  first  order 
by  specifying  the  number  of  Hg  and  Cd  atoms  in  the 
second-neighbour  shell  (the  four  first  neighbours  are 
always  tellurium),  Hgu-^Cd^,  with  a  concentration 
given  by  [y].  The  total  vacancy  concentration  is  given  by 

[Vh.]  =  I  D  ]/’«.'  exp(  -  E^,kT)  (35) 

I 

where  £,  is  the  vacancy  formation  energy  for  the  ;th 
cluster.  The  populations  of  vacancies  in  each  class  of 
cluster.;,  can  be  expected  lo  differ  because  of  differences 
in  the  cluster  populations  and  the  formation  energies. 
Additionally,  the  defect  energy  levels  may  differ  in  the 
various  classes  of  sites,  possibly  leading  to  different 
ionization  states  for  vacancies  in  different  classes  of  sites. 
If  the  cations  in  the  alloy  are  randomly  arranged,  such 
differences  may  be  difficult  to  infer  experimentally.  If, 
however,  the  cations  are  correlated,  exhibiting  short- 
range  order,  more  complex  behaviour  may  be  present.  As 
discussed  earlier,  such  short-range  order  has  been  found 
in  HgCdTe.  In  these  cases,  the  contribution  to  the 
vacancy  densities  from  the  dominant  ^.lass  of  clusters  will 
be  increased  Because  the  studies  finding  short-range 
order  focus  on  the  tellurium-centred  five-atom  clusters  of 
the  form  Hg4.,Cd,  rather  than  on  cation-centred  clus¬ 
ters  of  the  form  Hg,2.,Cd,,  higher-level  five-atom  clus¬ 
ter-cluster  correlations  must  be  known  to  predict  the 
effects  on  the  vacancy  populations. 

Next  we  examine  the  defects  which  accommodate 
excess  Hg  in  the  solid.  The  existence  region  for  HgCdTe 
IS  always  tehurium  rich,  and  thus  the  native  defects  which 
accommodate  excess  tellurium  are  expected  to  dominate. 
For  these  equations  in  table  2,  is  given  by 

K?,.  =  Co ‘(/cD-’'^(27rm„,)-'''li^exp(A5v,.A)  (36) 
^h,t.  =  Co ‘(^T)"’(27tm„,)"^h*’exp(AS„,^./k)  (37) 
and 

f^H,.  =  Co(k70-’''(2rrm„,)-^'^h^exp(AS„,./k).  (38) 

If  we  assume  the  change  in  entropy  is  comparable  for  all 
three  defects,  we  find 

L ’TeJ 

and 

(40) 

L*TeJ 

for  T  =  500  °C  and  £««  =  •  Front  equations  (39) 
and  (40)  we  see  that  the  tellurium  vacancy  is  a  minority 
defect  species.  For  the  pressure  and  temperature  consid¬ 
ered,  the  density  of  Hg  antisites  is  predicted  to  be 
comparable  to  the  density  of  Hg  interstitials.  Because  the 
ratio  of  [Hg^,]  to  [Hg,]  is  nearly  unity,  any  errors  in  the 
calculation  of  the  activation  energy  could  push  the 
balance  toward  one  side  or  the  other.  Thus  we  must 
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depend  on  the  next  generation  of  calculations,  with  the 
ASA  removed  and  full  relaxation  included,  plus  a  quanti¬ 
tative  comparison  of  the  entropy  differences  between  the 
mercury  antisitc  and  the  tellunum  interstitital.  to  deter¬ 
mine  the  dominant  defect  in  this  class. 

As  mentioned  above,  we  have  shown  that  the  tellur¬ 
ium  vacancy  formation  energy  vanes  significantly  with 
the  alloy  environment.  Because  the  tellunum  vacancy  is 
not  expected  to  be  a  dominant  defect  in  HgCdTe,  and  the 
tellurium  diffuses  by  an  interstitial  mechanism,  we  do  not 
expect  any  measurable  manifestation  of  this  vanation 
On  the  other  hand,  the  Hg  antisite  may  be  the  dominant 
Hg-excess  defect,  and  its  formation  energy  may  vary 
significantly  with  the  alloy  environment.  We  are  cur¬ 
rently  calculating  the  magnitude  of  this  vanation. 


33.  Diffusion 

HgCdTe  exhibits  a  complex  tracer  difusion  profile,  with 
both  a  fast  and  a  slow  branch.  The  fast  branch  is 
attributed  to  a  mechanism  with  vacancy  and  interstitial 
diffusion  in  parallel  where  the  dominant  diffuser  is  deter¬ 
mined  by  the  pressure  and  temperature,  while  the  slow 
component  fits  a  mechanism  with  vacancy  and  intersti¬ 
tial  diffusion  in  series  [15].  The  activation  energies  for  the 
fast  branch  as  discussed  previously  are  1.90  eV  and 
0  67  eV  for  the  vacancy  and  interstitial  mechanisms, 
respectively.  Our  calculated  formation  energy  for  the 
mercury  interstitials  are  0.84  and  0.98  eV  for  the  anion- 
and  cation-surrounded  tetrahedral  interstitial  sites,  re¬ 
spectively.  and  the  experimental  formation  energy  for  the 
mercury  vacancy  is  2.01  eV.  Comparing  these  energies 
with  the  experimental  activation  energies  we  find  close 
agreement,  indicating  that  the  migration  energy  contri¬ 
bution  to  the  diffusion  activation  energies  are  small  for 
both  interstitials  and  vacancies. 

In  the  vacancy  diffusion  case  we  are  comparing  two 
experimental  numbers  for  the  diffusion  and  formation 
energies  Because  the  diffusion  energy  (1.94eV)  is  the 
sum  of  the  formation  and  migration  energy,  the  fact  that 
the  vacancy  formation  energy  (2.01  eV)  is  larger  but 
should  be  smaller  is  an  indication  that  something  is 
amiss,  and  the  experiments  should  be  repeated  with  the 
goal  of  attaining  higher  accuracy. 

There  is  a  similar  discrepancy  in  the  interstitial  diffu¬ 
sion  case,  but  now  we  are  companng  the  experimental 
diffusion  activation  energy  0.67  eV  with  a  theoretical 
formation  energy  0.89  eV.  We  know  that  refinements  to 
the  theory  will  lower  the  predicted  value.  These  refine¬ 
ments  need  to  be  done  before  more  definitive  conclusions 
can  be  made. 

The  difference  between  the  two  numbers  obtained  for 
cation-  and  anion-surrounded  interstitital  sites  (0.84  eV 
and  0.98  eV)  sets  a  lower  bound  on  the  interstitial 
migration  energy.  Most  of  the  interstitial  Hg  will  sit  on 
the  lower-energy  anion-surrounded  site,  and  migrate 
through  the  intermediate-energy  cation-surrounded 
sites.  Examination  of  the  lattice  arrangement  between 
these  sites  leads  us  to  believe  that  the  potential  profile  is 
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unlikely  to  have  a  large  barrier  between  the  two  classes  of 
site,  and.  as  a  consequence,  the  interstitial  migration 
barrier  should  be  quite  low 

In  a  recent  experiment  on  mercury  diffusion  in  lon- 
implantation-damaged  HgCdTe.  an  activation  energy  of 
several  tenths  of  an  electron  volt  was  measured  [40],  The 
disparate  results  can  be  interpreted  as  a  measure  of  only 
the  defect  migration  contribution  to  the  diffusion  activa¬ 
tion  energy,  since  defects  in  excess  of  the  equilbinum 
concentration  were  probably  formed  during  implanta¬ 
tion.  It  IS  not  evident  that  the  measured  activation  energy 
corresponds  to  the  vacancy  or  the  interstitial  mechanism. 
The  conclusion  that  the  diffusion  activation  energies  are 
largely  defect  formation  energies,  with  the  migration 
energies  being  much  smaller,  is  in  agreerru-nt  with  the 
above  interpretation  of  the  Richter  and  Kalish  [40] 
expenment. 


4.  Conclusions 

We  have  incorporated  our  calculated  defect  energies  into 
the  mass  action  equations  for  the  neutral  defects  in 
HgCdTe.  In  agreement  with  expenment,  we  find  the 
mercury  vacancy  to  be  the  dominant  native  defect  in 
tellurium-rich  material.  We  also  find  the  mercury  antisite 
and  interstitial  defect  densities  to  be  comparable,  al¬ 
though  a  better  calculation  of  the  vibrational  entropy  is 
needed  to  confirm  this  result.  Comparing  the  defect 
formation  energies  of  Vydyanath  [13,  14]  and  our  theory 
to  the  diffusion  measurements  by  Chen  [16]  and  by  Tang 
and  Stevenson  [15],  we  find  agreement  with  their  diffu¬ 
sion  activation  energies  for  both  the  vacancy  and  the 
interstitial  mechanisms,  if  we  assume  that  the  migration 
energy  is  small  in  both  cases.  Small  migration  energies 
are  consistent  with  the  measurement  of  Richter  and 
Kalish  [40]  and  our  theory.  Further  work  is  in  progress 
to  incorporate  the  defect  charge  states  into  the  calcula¬ 
tion.  and  also  to  calculate  the  fully  relaxed  defect  energies 
with  the  full  potential  lmto. 

If  the  cation  migration  energies  are  as  small  as  we  are 
suggesting,  then  some  cation  motion  will  be  present  even 
at  room  tempierature.  This  implies  that  the  crystal  can 
equilibrate  to  its  room  temperature  order-disorder  cor¬ 
relation  state.  Most  semiconductor  alloys  at  room  tem¬ 
perature  are  in  metastable  correlation  states  frozen  to  the 
temperature  where  diffusion  stopped  as  they  were  cooled. 
How  this  affects  devices  remains  to  be  determined. 

We  predict  that  near  a  Te  inclusion  there  will  be  an 
‘atmosphere’  of  defects  differing  from  the  one  the  bulk 
solid  has  in  equilibrium  with  a  Hg  vapour.  In  this 
atmosphere  the  ratio  of  the  concentrations  of  the  tellur¬ 
ium  antisites  to  Hg  vacancies  will  be  increased.  The 
tellurium  antisite  concentration  may  still  be  too  small  to 
affect  any  device  properties,  but  if  the  Hg  vacancy 
concentration  is  greater  near  the  inclusims  than  in  the 
bulk,  this  will  be  a  source  of  spatial  variability  in  the 
material.  This  is  a  potential  source  of  the  worms  in  lpe 
material.  We  are  cautious  about  this  suggestion  because 


TEM  studies  of  annealed  lpe  material  indicate  that  theie 
are  few,  if  any.  Te  inclusions  remaining  [7] 

First-principles  theories  are  now  fast  enough  and 
accurate  enough  for  practical  use  in  process  and  perfor¬ 
mance  modelling  An  effort  to  capture  the  results  of  many 
experiments  into  consistent  models,  and  thereby  increase 
their  reliability,  has.  as  we  have  tned  to  demonstrate  in 
this  paper,  become  a  realistic  goal 
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A  tight-binding  model  is  used  to  calculate  the  formation  energies,  bond  lengths,  and  bulk  moduli 
of  a  number  of  ordered  III-V  and  II-VI  semiconductor  alloys.  The  parameters  in  the  model  are  ad¬ 
justed  so  that  the  bond  lengths,  cohesive  energies,  bulk  moduli,  and  shear  elastic  constants  for  the 
constituent  compounds  are  described  correctly.  The  model  is  then  applied  to  alloys  without  further 
adjustment.  Based  on  the  calculated  excess  energies,  we  conclude  that  none  of  the  ordered  alloys 
found  experimentally  is  in  its  stable  bulk  equilibrium  state  at  the  growth  temperatures.  Although 
the  alloy  excess  energies  can  be  negative,  if  the  reference  constituent  compounds  are  constrained  to 
match  the  substrate  lattice  used  in  epitaxial  growth,  their  magnitudes  are  not  large  enough  to  ac¬ 
count  for  the  observed  ordering.  A  possible  explanation  of  the  observed  states  in  terms  of  a  barrier 
that  prevents  the  metastable  ordered  alloy  from  decomposing  into  separate  phases  is  presented. 
However,  this  explanation  only  applies  to  alloys  with  lattice-mismatched  constituents.  Detailed  re¬ 
sults  on  the  bond  lengths  and  bulk  moduli  are  also  discussed. 


1.  INTRODUCTION 

The  bulk  semiconductor  alloys  were  long 

thought  to  be  ideal  pseudobinary  compounds,  in  which 
the  C  atoms  sit  in  a  fee  sublattice  while  the  substituting 
atoms  A  and  B  randomly  occupy  the  sites  of  the  other  fee 
sublattice.  However,  several  recent  findings  in  the  last 
few  years  revealed  quite  a  different  picture.  First,  extend¬ 
ed  x-ray-absorption  fine-structure  (EXAFS)  experi¬ 
ments'"  clearly  showed  a  bimodal  distribution  for  the 
nearest-neighbor  bond  lengths  in  these  alloys,  which  im¬ 
plies  that  the  equilibrium  atomic  positions  are  not  the 
virtual-crystal  sites  of  the  zinc-blende  crystal.  Recent  ex¬ 
periments^'*  and  theories’’^'  also  indicated  that  the  ar¬ 
rangement  of  the  alloying  atoms  in  these  systems  is  not 
completely  random.  Most  surprising  of  all,  however,  has 
been  the  discovery  of  long-range  ordering  (LRO)  in  these 
alloys  grown  epitaxially.  Essentially,  all  the  III-V  al¬ 
loys  grown  by  molecular-beam  epitaxy  (MBE)  or  metal 
organic  chemical  vapor  deposition  (MOCVD)  under 
some  special  growth  conditions  are  found  to  be  ordered. 
While  a  great  majority  of  these  ordered  alloys  form  the 
ABC2  compounds  in  one  or  more  of  the  following  three 
crystal  structures:  the  CuPt  (CP),  CuAu  I  (CA),  and 
chalcopyrite  (CH),  a  few  alloys  are  ordered  in  the  form  of 
AyBC^  with  the  famatinite  or  luzonite  structure.  These 
crystal  structures  have  been  well  described  by  Wei 
et  al.^°  Table  I  is  a  partial  list  of  the  LRO  alloys  that 
have  been  grown,  together  with  the  growth  conditions 
and  ordered  structure  found.  For  later  emphasis,  we  note 
that  the  substrate  temperatures  for  the  ordering  to  occur 
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range  from  4(X)'’C  to  800  °C,  and  the  ordering  directions 
are  not  necessarily  the  same  as  the  growth  direction. 

Finding  LRO  is  surprising,  because  it  is  at  variance 
with  the  well-established  conventional  picture  for  the 
bulk  semiconductor  alloys  having  simple  phase  diagrams 
with  miscibility  gaps"'”^*  driven  by  strain  energy.  The 
question  is,  arc  these  ordered  alloys  thermodynamic  equi¬ 
librium  states?  This  question  can  be  answered  if  accurate 
values  for  the  alloy  excess  energies  can  be  determined. 
To  be  specific,  we  shall  only  consider  three  important 
structures,  CP,  CA,  and  CH,  for  the  ABC2  alloys,  and 
define  an  excess  energy  A£  as 

^E=E{ABC2)-[E{AC)+E(BC)]  ,  (1) 

where  E  ( ABC2 )  is  the  energy  per  molecule,  or  per  four 
atoms,  in  the  alloy  ABC2,  and  E(AC)  and  E{BC)  are 
the  energies  per  pair  of  atoms  in  the  AC  and  BC  zinc- 
blende  compounds,  respectively.  If  these  ordered  alloys 
are  in  their  thermodynamically  stable  states  at  the 
growth  temperature,  A£  has  to  be  negative  and  must 
have  a  magnitude  considerably  larger  than  200  meV  on 
the  present  scale.  This  would  be  in  contradiction  with 
the  positive  values  of  A£  previously  reported  for  the  bulk 
semiconductor  alloys. However,  the  ordered 
semiconductor  alloys  found  from  the  epitaxial  growth 
may  be  in  a  constrained  equilibrium  state,  where  the  con¬ 
straint  is  imposed  by  the  substrate  strain.  We  need  to 
know  the  energetics  of  the  various  states  involved  besides 
the  bulk  access  energy  before  we  can  start  understanding 
the  ordering  and  stability  of  these  alloys.  The  calculation 
of  some  of  these  energies  along  with  an  accuracy  analysis 
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TABLE  1.  A  list  of  the  ordered  Ill-V  semiconductor  alloys  identified  experimentally. 


Alloys 

Structure 

Growth 

method 

Substrate 

Temperature  I'C) 

Reference 

AlGaAs; 

CA 

MOeVD  and  MBE 

GaAs(lOO)  and  GaAsdlOl 

600-800 

Ref.  7 

AllnP, 

CP 

MOeVD 

GaAs(OOl) 

650-700 

Ref.  13 

AlInAs_, 

CP 

MOeVD 

InPtOOl) 

600 

Ref  9 

GalnP; 

CP 

MOeVD 

GaAs(OOl) 

650 

Ref  14 

CP 

MOeVD 

GaAsiOOl) 

640 

Ref  15 

CP 

MOeVD 

GaAs(OOl) 

650-700 

Ref  13 

CP 

MOeVD 

GaAsiOOl) 

600-630 

Ref  16 

CP 

MOeVD 

GaAsiOOl) 

600-700 

Ref  17 

GalnAs; 

famalinite 

LPE 

InPlllO) 

630 

Ref  8 

luzonite 

MBE 

InPiOOl) 

400 

Ref  10 

CA 

MBE 

InPlllO) 

500 

Ref  11 

CP 

VLE 

InPlOOl) 

650-660 

Ref  12 

GaiAsSb 

CA 

MOeVD 

InPIlOO) 

550-680 

Ref  18 

CH 

MOeVD 

InPilOO) 

600 

Ref  18 

CP 

MBE 

540 

Ref  19 

is  the  main  purpose  of  this  paper. 

The  e.xcess  energies  for  a  number  of  ordered  semicon¬ 
ductor  alloys  have  been  calculated  from  local-density- 
functional  (LDF)  theory  using  various  band-structure 
methods.*  Although  the  LDF  error  for  the 

cohesive  energy  of  a  III-V  compound  is  typically  several 
tenths  of  one  eV  per  pair  of  atoms,  it  is  generally  believed 
that  the  excess  energy  based  on  the  same  technique  is  ac¬ 
curate  to  several  meV,  because  the  errors  in  LDF  cancel 
in  Eq.  (1).  However,  the  ordered  alloys  that  we  are  con¬ 
sidering  are  open  structures  containing  several  atoms  per 
unit  cell.  The  atomic  positions  in  these  alloys  are  usually 
distorted  away  from  the  regular  zinc-blende  sites.  Only 
the  most  sophisticated  band-structure  theories  which  are 
capable  of  treating  shear  distortion,  such  as  the  full- 
potential  linear  combination  of  muffin-tin  orbitals  (FP- 
LMTO),^^  full  potential  augmented  plane  waves  (FP- 
APW),^"*  and  the  fully  converged  plane-wave  pseudopo¬ 
tential  method,^*  can  be  expected  to  yield  precise  results 
within  LDF.  Even  with  present-day  computers,  it  is  still 
too  expensive  to  use  these  methods  to  perform  calcula¬ 
tions  over  a  wide  range  of  semiconductor  alloys.  On  the 
other  hand,  although  the  valence-force-field  (VFF)  mod¬ 
el^*' is  simple  and  is  effective  in  treating  the  strain  ener¬ 
gy,  it  cannot  account  for  the  chemical  energy.^*  These 
considerations  have  motivated  us  to  use  the  empirical 
tight-binding  (ETB)  method.  The  ETB  not  only  can  treat 
both  the  strain  and  chemical  energies  but  also  allows  for 
precise  and  systematic  computations.  To  eliminate  the 
propagation  of  errors  from  constituent  compounds  to  al¬ 
loys,  the  parameters  in  ETB  are  adjusted  to  produce  the 
experimental  values  for  the  cohesive  energy,  bond  length 
d,  bulk  modulus  B,  and  the  shear  elastic  constant 
C||  —  C,2  for  each  constituent  crystal.  These  parameters 
are  then  used  in  the  alloy  calculation  without  further  ad¬ 


justment.  Our  method  thus  corresponds  to  an  interpola¬ 
tion  scheme  for  the  alloys  between  the  constituent  com¬ 
pounds.  Such  an  approach  is  particularly  appropriate  for 
the  present  study,  because  the  alloys  and  the  constituent 
crystals  have  very  similar  structures  and  local  bonding. 

The  rest  of  this  paper  is  arranged  as  follows:  Section  II 
describes  the  ETB  model,  the  way  the  parameters  are 
determined,  and  the  results  for  the  structural  properties 
of  the  constituent  crystals.  Section  III  briefly  describes 
the  structural  parameters  and  the  energy-minimization 
procedure  for  the  three  alloy  structures  in  both  their  bulk 
equilibrium  states  and  in  states  constrained  to  match  sub¬ 
strates.  The  calculated  excess  energies  are  summarized  in 
Sec.  IV  and  are  compared  with  those  from  LDF  and 
VFF.  To  provide  more  detailed  structural  information, 
the  calculated  bond  lengths  and  bulk  moduli  of  the  alloys 
are  also  presented.  The  final  section.  Sec.  V,  contains  a 
summary  and  discussion. 


II.  TIGHT-BINDING  MODEL 

The  tight-binding  (TB)  model  that  we  are  going  to  use 
is  very  similar  to  that  used  by  Chadi^*  and  Harrison. 

The  total  energy  of  a  semiconductor  crystal  is  assumed  to 
be  the  sum  of  the  electron  energies  E^,(k)  in  the  valence 
bands  and  the  pair  repulsive  energies  u,j  between  the 
nearest-neighbor  atoms: 

=  +  ■  <21 
tt  k  i  >j 

Furthermore,  the  band  energies  are  calculated  using  a 
minimum-basis  TB  Hamiltonian  which  includes  one  s 
and  three  p  orbitals  per  atom.  The  interaction  parame¬ 
ters  needed  from  the  Hamiltonian  are  the  term  values  e. 
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and  Ep  of  each  atom  and  the  nearest-neighbor  two-center 
interactions  and  For  a  given  crys¬ 

tal  structure,  the  Hamiltonian  //(k)  associated  with  a 
given  wave  vector  k  within  the  first  Brillouin  zone  has  a 
dimension  of  Am,  where  m  is  the  number  of  atoms  per 
unit  cell.  The  diagonal  part  of  //(k)  consists  of  the  term 
values,  and  the  off-diagonal  elements  are  computed  as 

//yy.(k)=  .  (3) 

d 

where  the  sum  runs  over  the  first-neighbor  bond  displace¬ 
ments  dyy  that  point  from  the  orbitals  denoted  y  to  y\ 
The  y  stands  for  the  s,  p^,Py,  or  p^  orbital  of  a  particular 
atom  in  a  given  unit  cell.  The  matrix  elements  h  are  re¬ 
lated  to  the  two-center  interactions  by  the  Slater-Koster 
relations,^' 


Ksjct  • 

(4) 

(5) 

f'xx=ci]yppo  +  ''-a])Vpp„  , 

(6) 

ftxy  =  aia2{Vpp„-Vpp„)  , 

(7) 

where  a^^x^/d  are  the  direction  cosines  of  d,  while  all 
the  I^s  depend  only  on  the  length  d. 

The  first  task  is  to  determine  the  forms  for  the  interac¬ 
tions  and  the  repulsive  energy  u  from  the  constitu¬ 
ent  compounds.  Since  the  strain  energy  plays  a  very  im¬ 
portant  role  in  the  alloy  formation  energy,  we  shall  make 
sure  that  our  model  produces  the  correct  elastic  con¬ 
stants.  To  keep  the  model  close  to  Harrison’s^^'^  origi¬ 
nal  form,  but  to  free  it  from  his  rigid  \/d^  and  l/d*  scal¬ 
ing  rules  for  and  u,  respectively,  we  assume  the  fol¬ 
lowing  forms; 

(8) 


and 

u{d)=^UoidQ/d)'”  ,  (9) 

where  the  superscript  and  subscript  0  indicate  the  values 
evaluated  at  the  equilibrium  bond  length  dQ.  The  values 
of  taken  to  be  Harrison’s  universal  forms  scaled 

by  a  factor  /, 

dO) 

Note  that  Harrison’s*®  universal  two-center  interactions 
take  the  form 

where  m  is  the  free-electron  mass,  fi  is  Planck’s  constant, 
and  the  tj’s  take  the  following  values:  — 1.32, 

1.42, 77pp„  =  2.22,  and  17^^^=  -0.63. 

Thus  there  are  four  adjustable  parameters  for  each  sys¬ 
tem:  the  scaling  parameter  /,  the  powers  n  and  m,  and 
the  value  Uq.  These  parameters  are  determined  by  re¬ 
quiring  that  the  model  produce  the  correct  experimental 
values  for  the  bond  energy  Fbond*  shear 

elastic  constant  C,  1  ^12-  Since  Cn— C,2  is  governed 
only  by  in  the  present  model,  it  alone  determines  the 
scaling  factor  /.  Then  the  bond  energy  F^ond  **  *0 

determine  Uq.  The  requirement  that  the  first  derivative 
of  Ej  be  zero  at  d^  then  determines  the  ratio  of  the 
powers  n  /m,  which  couples  with  the  equation  for  the 
bulk  modulus  to  yield  the  values  for  n  and  m.  One  can 
then  use  these  sets  of  parameters  to  calculate  other  quan¬ 
tities  that  are  not  employed  in  the  fitting,  e.g.,  another 
shear  elastic  constant  C44,  the  Kleinman  internal- 
displacement  parameter**  and  the  optical-phonon  fre¬ 
quencies  0)  at  the  zone  center,  to  check  the  validity  of  the 
model. 

In  the  actual  calculations  we  used  the  term  values  tab¬ 
ulated  by  Chen  and  Sher.*^  Table  II  lists  the  experimen- 


TABLE  II.  Values  of  bond  length  d,  bond  energy  £bond>  bulk  modulus  B,  and  shear  coefficient 
C  =  C||  —  C|2  used  to  determine  the  parameters  in  Table  III.  Also  listed  are  the  experimental  values  of 
C44  and  the  TO-phonon  mode  cu  at  F  to  be  compared  with  the  calculations.  All  the  elastic  constants 
are  in  units  of  10"  dyn/cm%  d  in  A,  Ebond  in  cV,  and  tJjo  is  given  in  terms  of  wave  numbers  in  cm“'. 


The  sources  of  these  values  are  discussed  in  Ref.  47.  Also  listed  are  the  force  constants  a  and  p  (N/m) 
for  the  valence-force-field  model  in  Eg.  (13). _ 


d 

r 

^'bond 

B 

C 

c„ 

0) 

a 

P 

AlP 

2.367 

-2.13 

8.600 

6.900 

6.150 

440 

43.867 

9.429 

AlAs 

2.451 

-1.89 

7.727 

7  160 

5.420 

361 

40.360 

10.132 

AlSb 

2.656 

-1.76 

5.817 

4.428 

4.076 

366 

33.417 

6.790 

GaP 

2.360 

-1.78 

9.143 

7.870 

7.143 

367 

46.257 

10.723 

GaAs 

2.448 

-1.63 

7.690 

6.630 

6.040 

269 

40.351 

9.371 

GaSb 

2.640 

-1.48 

5.792 

4.946 

4.440 

231 

32.800 

7.539 

InP 

2.541 

-1.74 

7.247 

4.460 

4.600 

304 

40.346 

6,543 

InAs 

l.tll 

-1.55 

5.794 

3.803 

3.959 

219 

33.165 

5.757 

InSb 

2.805 

-1.40 

4.831 

3.130 

3.132 

185 

29.605 

5.069 

ZnTe 

2.637 

-1.20 

5.090 

3.060 

3.120 

177 

29.445 

4.659 

CdTe 

2.806 

-1.10 

4.210 

1.680 

2.040 

141 

26.374 

2.722 

HgTe 

2.798 

-0.81 

4.759 

1.817 

2.259 

116 

29.TJZ 

2.935 

124 
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tal  values^*'*  for  the  fbond’ 

used  to  fit  the  parameters,  and  the  values  of  C44  and 
transverse  phonon  frequency  oj  used  for  the  consistency 
check. 

The  elastic  constants  can  be  calculated  directly  from 
perturbation  theory.  First  the  Hamiltonian  //(k)  is  ex¬ 
panded  in  powers  of  the  infinitesimal  strain  parameters  e 
up  to  second  order: 


H  =Ho  +  H^e+\H2<^\  (11) 

where  //q  is  the  strain-free  Hamiltonian  and  //,  and  Hi 
are,  respectively,  the  first  and  second  derivatives  with 
respect  to  e  evaluated  at  e  =  0.  The  band-energy  contri¬ 
bution  to  the  strain  coefficient  then  comes  from  the 
second  derivative  of  with  respect  to  #■,  denoted  by 


^'^BS 

a,’ 


22  <^'k!«;l'-’.k) 

1-  k 

ftf  £„(k)-e,(k) 


(12) 


which  is  consistently  larger  than  the  n  =  2  used  in 
Harrison's  universal  TB  parameters.  For  the  repulsive 
pair  energy  u  =  Uoldg/tf the  power  m  ranges  from  5.4 
to  7.1.  The  ratio  m/n  falls  in  the  range  from  1.5  to  1.9, 
which  is  smaller  than  the  m/n  =  2  used  by  Harrison.^** 
The  calculated  values  of  C44  for  most  systems  agree  with 
experiment  to  10%  or  better.  The  calculated  TO-phonon 
modes  at  F  for  most  III-V  systems  also  agree  with  experi¬ 
ments  to  10%  or  better.  The  discrepancies  for  the  Il-Vl 
systems  are  larger  (about  15%).  Reliable  results  for  C 
from  both  experiments  and  first-principles  calculations 
are  only  available  for  a  limited  number  of  semiconduc¬ 
tors.  The  calculated  and  oj  from  the  TB  model 

agrees  very  well  with  those  results,  as  shown  in  Table  IV. 

The  results  in  Tables  III  and  IV  are  based  on  a  particu¬ 
lar  set  of  term  values  and  TB  parameters.  It  is  useful  to 
examine  how  the  predictions  are  influenced  by  these  pa¬ 
rameters  and  the  fitting  procedure.  Table  V  shows  a  re¬ 
sult  based  on  Chadi's  procedure^*  **  in  which  the  TB  ma¬ 
trix  elements  are  scaled  as  \/d'.  and  the  repulsive 
pair  energy  is  taken  to  be 


where  £^(k)  and  ic,k)  are,  respectively,  the  eigenenergies 
and  eigenvectors  of  Hq  for  the  conduction  bands.  Simi¬ 
larly,  i',k  indexes  the  valence  bands.  Note  that  the  inter¬ 
valence-band  contributions  in  the  second-order  perturba¬ 
tion  sum  cancel  exactly  and  so  they  are  not  needed  in  Eq. 
(12).  The  matrix  elements  of  and  //.  needed  here  can 
be  expressed  from  Eqs.  (3)-(7)  in  terms  of  the  first  and  the 
second  strain  derivatives  of  the  two-center  interactions 
and  the  direction  cosines  a,. 

Table  III  shows  the  results  for  /,  n,  m,  and  Uy  obtained 
for  the  constituent  compounds,  and  the  corresponding 
values  of  C44,  C,  and  w  calculated  as  a  consistency  check. 
The  scaling  factor  /  ranges  from  1.1  to  1.5  and  tends  to 
decrease  with  an  increase  in  polarity.  In  the  power-law 
dependence  (<fy/J)",  n  ranges  from  3.3  to  4.2, 


The  parameter  Uy  is  set  to  produce  the  correct  bond  ener¬ 
gy,  u  I  is  determined  by  requiring  the  correct  equilibrium 
bond  lengtn,  and  U;  is  fixed  by  the  bulk  modulus.  Two 
sets  of  TB  parameters  are  tabulated  for  each  system:  One 
of  them  is  the  set  used  by  Chadi^*  and  the  other  (labeled 
present  work)  is  the  set  obtained  by  multiplying 
Harrison  s  Vaa'  by  the  scaling  factor  /  listed  in  Table  III. 
For  convenient  comparison,  the  zero  of  the  term  values  is 
set  equal  to  the  anion  s  energy.  Despite  considerable 
differences  in  these  two  sets  of  TB  parameters,  the  results 
of  the  predictions  from  the  two  sets  are  very  similar  and 
also  very  similar  to  those  predicted  from  the  other  pro¬ 
cedure  given  in  Table  IV.  The  only  noticeable  difference 


TABLE  Ill.  The  results  for  the  parameters  /,  n,  m,  and  u„  obtained  from  the  fitting  of  the  bond  en¬ 
ergy,  bond  length,  bulk  modulus,  and  shear  coefficient  C,,— C,;  of  Table  II  using  the  full  band- 
structure  calculation.  Also  listed  are  the  calculated  C44,  internal-displacement  parameter  f,  and  the 
TO-phonon  mode  ai  at  P.  All  the  eiasric  constants  are  in  units  of  10"  dyn/cm%  Uq  is  in  eV,  and  oj  are 
given  in  terms  of  wave  numbers  in  ctn  _ 


f  n  m 


AlP 

1.294 

3.530 

5.598 

AlAs 

1.464 

3.524 

5.430 

AlSb 

1.337 

3.268 

5.668 

GaP 

1.395 

3.705 

5.683 

GaAs 

1.397 

3.633 

5.716 

GaSb 

1.431 

3.471 

5.717 

InP 

1.323 

4.240 

6.633 

InAs 

1.300 

3.997 

6.427 

InSb 

1.353 

3.773 

6.399 

ZnTe 

1.284 

3.306 

5.828 

CdTe 

1.171 

3.656 

6.761 

HgTe 

1.173 

3.760 

7.074 

Un 

C44 

C 

(0 

6.435 

5.827 

0.516 

447 

7.089 

5.598 

0.459 

384 

4.838 

3.944 

0.564 

354 

7.285 

6.857 

0.501 

382 

6.530 

5.791 

0.500 

292 

5.519 

4.515 

0.536 

256 

5.603 

4.260 

0.584 

304 

4.962 

3.564 

0.552 

220 

4.350 

3.092 

0.602 

200 

4.285 

3.092 

3.080 


125 


2.813 
1.70 1 
2.040 


0.590 

0.694 

0.716 


205 

156 

152 
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between  the  predictions  in  Table  IV  and  Table  V  is  that 
in  the  latter  the  phonon  frequencies  are  slightly  larger 
and  C44  are  slightly  smaller.  These  comparisons  show 
that  the  model  with  parameters  given  in  Table  II  not  only 
contains  the  correct  structure  properties  inserted  through 
the  fitting  procedure,  but  also  predicts  other  mechanical 
properties  with  reasonable  accuracy.  This  model  should 
serve  as  a  good  basis  for  alloy  calculations. 

III.  ALLOY  CALCULATION 

A.  CuAu  I  structure  (CA) 

The  ABCi  semiconductor  alloy  in  the  CA  structure 
forms  a  layer  structure  ACBCACBC  ■  ■  ■  along  the 
<001 )  direction.  The  basic  lattice  vectors  can  be  chosen 
as  a,  — (|,i,0)a,  j,  —  y,0)a,  and  a3  =  (0,0,/S)a, 

where  P  is  the  c/a  ratio  with  a  and  c  being  the  lattice  pa¬ 
rameters.  Note  that  the  ideal  /3  value  for  a  zinc-blende 
structure  is  1.  There  are  four  atoms  per  unit  cell:  one  A 
atom  at  (0,0,0),  one  B  atom  at  (y,0,^/2)a,  and  two  C 
atoms  at  (l,l,^-l-S)a/4  and  (3,1, 3/3— S)a/4,  where  S  is 


TABLE  IV.  Comparison  between  calculated  and  experimen¬ 
tal  lattice  constant  a,  elastic  constants  B,  C||— C,:  and  C*,. 
Kleinman  (Ref.  42)  internal-displacement  parameter  and  the 
TO-phonon  frequency  eo  in  wave  numbers  cm'  Also  listed  are 
C£'  which  correspond  to  the  value  without  the  internal  distor¬ 
tion.  The  FP-LMTO  and  PP-PW  are  the  ab  initio  theories,  and 
TB  is  our  tight-binding  method  discussed  in  the  text.  All  elastic 
constants  are  in  units  of  10"  dyn/cm^  Experimental  values  are 
those  listed  in  Table  H. 


Expt. 

FP-LMTO* 

PP-PW" 

TB 

Si 

a 

5.431 

5.41 

5.45 

5.431 

B 

9.923 

9.9 

9.3 

9.923 

C|l 

10.274 

10.2 

9.8 

10.274 

C*. 

8.036 

8.3 

8.5 

8.013 

/^(0> 

'-44 

11.1 

11.30 

0.54' 

0.51 

0.53 

0.51 

(O 

523 

518 

521 

572 

Ge 

a 

5.65 

5.59 

5.65 

B 

7.653 

7.2 

7.653 

^11  “^-12 

8.189 

8.5 

8.189 

c« 

6.816 

6.3 

6.84 

/^(O) 

*-44 

7.7 

9.46 

0.44 

0.49 

0) 

303 

302 

342 

GaAs 

a 

5.642 

5.55 

5.642 

B 

7.69 

7.3 

7.69 

Cll  — C,2 

6.63 

7.0 

6.63 

C44 

6.04 

6.2 

5.791 

/•'(Oi 

^44 

7.5 

7.83 

0.48 

0.50 

(i) 

273 

268 

292 

•Reference  33. 
“"Reference  49. 
'Reference  50. 


the  internal  distortion  parameter  for  the  C  atoms.  There 
are  only  two  different  nearest-neighbor  bond  lengths  in 
the  alloy: 

d,«c  =  la/4)[2  +  (/3+8)‘]'''^ 

and 

For  a  set  of  values  for  a,  /S,  and  6,  the  total  energy  is  cal¬ 
culated  following  the  general  description  in  Sec.  II.  The 
interactions  and  the  repulsive  energy  are  scaled  by 
the  bond  lengths  according  to  Eqs.  (8)  and  (9),  respective¬ 
ly.  The  Hamiltonian  Hik)  is  now  a  16X  16  matrix.  The 
total  energy  is  then  minimized  by  varying  the  three  pa¬ 
rameters  a,  0,  and  S.  If  the  alloy  is  constrained  to  match 
a  (001)  substrate,  the  lattice  constant  a  is  fixed  by  the  sub¬ 
strate,  and  the  total  energy  is  minimized  with  respect  to  0 
and  5. 

B.  Chalcopyrite  structure  (CH) 

The  ABC2  semiconductor  alloy  in  the  CH 
structure  forms  a  superlayer  structure 
ACACBCBCACACBCBC  ■  ■  ■  along  the  (012)  direc¬ 
tion.  The  basic  lattice  vectors  can  be  chosen 
as  a,  =  (l,l,-2i8)o/2,  a2  =  (  -  1,  l,2^)a /2,  and 
aj=(  1,  —  l,2/3)a/2,  where  0  again  is  the  c/a  ratio,  with 
an  ideal  value  of  1.  There  are  now  eight  atoms  per  unit 
cell:  two  A  atoms  at  (0,0,0)  and  (O,],0)a/2,  two  B  atoms 
at  il,O,0)a/2  and  (l,l,0)fl/2,  and  four  C  atoms  at 
(l+8,l,/S)a/4,  (l,3  +  6,3/5)fl/4,  (3,1 -6,3^)fl/4,  and 

(3— 6,3,/3)fl/4,  where  8  is  the  internal  distortion  parame¬ 
ter  for  the  C  atoms.  Again  there  are  also  only  two 
different  nearest-neighbor  bond  lengths  in  the  alloy: 

£/^c  =  (fl/4)[l  +  (l+8)^  +  /3^]'^^ 

and 

dac=(o/4)ll  +  (l-8)*  +  /3-]''^  . 

A  similar  procedure  can  now  be  carried  out  to  minimize 
the  total  energy  with  respect  to  the  three  parameters  a,  0, 
and  6.  However,  the  Hamiltonian  Hik)  is  now  32X32. 
For  the  case  in  which  the  lattice  is  matched  to  the  (001) 
substrate,  we  again  are  left  with  two  parameters  0  and  5 
to  vary  for  the  energy  minimization. 

C.  CuPt  structure  (CP) 

In  the  CP  structure,  the  alloy  forms  a  (111)  superlat¬ 
tice  ACBCACBC  •  ■  •  .  Because  of  a  lack  of  reflection 
symmetry  about  any  of  these  planes,  the  B  layer  need  not 
be  located  exactly  at  the  middle  position  between  the  two 
successive  A  layers.  Also  the  distance  between  two 
closest  atoms  from  two  different  A  layers  may  not  need 
to  correlate  with  that  between  two  A  atoms  on  the  same 
plane.  Thus  there  are  a  total  of  five  independent  parame¬ 
ters  required  to  describe  the  crystal  structure:  the  lateral 
lattice  constant  a  for  the  layers,  the  spacing  D  between 
two  successive  A  layers,  and  the  three  spacing  parame¬ 
ters  for  the  three  layers  (one  B  and  two  C)  inside  D. 
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TABLE  V.  Comparison  between  the  two  different  sets  of  TB  parameters  described  in  the  text;  the  resultant  coefficients  uo>  u,,  and 
U;  of  and  the  predicted  elastic  constants,  Kleinman  internal-displacement  parameters  a.  and  phonon  frequency  (u. 
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'  ipa 

y 
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Si 

Chadi 

0.0 

7.20 

0.0 

7.20 
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Present 
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Uo 

u> 

C 
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Si 
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23.90 
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7.89 
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0.49 
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Present 
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work 
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ef 
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Chadi 

0.0 

9.64 

5.12 
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2.40 
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Present 
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2.52 

2.52 

3.94 

-1.12 

work 

Uil 

U  2 

C 
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^  44 

0) 

GaAs 

Chadi 

5.12 

-7.12 

18.22 

6.36 

5.60 

8.77 

0.54 

339 

Present 

6.53 

-8.39 

19.90 

6.63 

5.70 

8.53 

0.54 

322 

work 

There  are,  however,  only  four  atoms  per  unit  cell,  each 
coming  from  a  different  layer,  so  the  Hik)  matrix  is 
16X16.  There  are  also  four  different  bond  lengths  in  the 
crystal,  two  for  the  AC  and  two  for  the  BC  bonds.  If  the 
alloy  lattice  is  matched  to  a  (111)  substrate,  the  lateral 
lattice  parameter  is  fixed,  but  we  still  have  four  parame¬ 
ters  to  vary  for  energy  minimization.  However,  if  the  al¬ 
loy  is  matched  to  a  (001)  substrate,  we  shall  assume  that 
all  the  alloying  atoms  A  and  B  are  locked  into  the  fee  lat¬ 
tice  points  of  the  substrate,  and  we  are  left  with  only  two 
free  parameters  which  describe  the  relaxation  of  the  two 
nonequivalent  C  layers. 

It  is  useful  to  comment  on  the  Brillouin-zone  (BZ)  in¬ 
tegration  needed  for  the  calculation  of  the  electronic  en¬ 
ergy.  Since  our  calculation  involves  relatively  small  and 
easily  handled  matrices,  we  are  able  to  sample  over  a 
large  number  of  k  points.  We  found  that  a  uniform  grid 
of  1000  k  points  inside  the  BZ  always  guarantees  a  con¬ 
vergence  of  the  total  energy  pair  of  atom  to  within  10”^ 
meV  and  the  elastic  constants  to  an  accuracy  in  the  third 
digit.  We  found  that  total  energies  calculated  using  two 
special  k  points^'  are  about  10  meV  per  atom  pair  higher 
than  the  converged  value.  However,  when  extending  the 
two-special-k-point  method  to  the  CA  and  CH  struc¬ 
tures,'^  most  of  the  errors  of  the  alloy  and  pure  com¬ 
pound  cancel,  and  we  found  that  the  final  errors  in  the 
excess  energies  are  only  about  0.5  meV  per  atom  pair. 

Before  presenting  the  results,  we  briefly  describe  the 
Keating^"  valence-force-field  (VFF)  model.  The  VFF 
only  deals  with  the  strain  energy.  The  energy  per  unit 
volume  in  a  strained  diamond  or  zinc-blende  crystal  is 
given  by 


oa  0  I  0 


I  >1 


(13) 


where  in  the  first  term,  the  bond-stretching  energy,  the 


summation  i  runs  over  all  the  bonds,  and  in  the  second 
term,  the  bond-angle  contribution,  the  summations  in¬ 
clude  all  the  pairs  of  bonds  that  share  common  atoms. 
The  do  'S  the  equilibrium  bond  length,  and 

A(d, -d^)  is  the  strain-induced  change  of  the  dot  product 
between  the  two  bond  vectors  which  point  from  the  com¬ 
mon  atom  to  the  nearest -neighbor  atoms.  For  a  zinc- 
blende  crystal,  there  is  only  one  value  for  the  bond¬ 
stretching  force  constants  a, —a,  and  one  value  for  the 
bond-angle-restoring-force  constant  Their  values 

are  determined’**  by  fitting  the  experimental  bulk  moduli 
B  and  shear  elastic  constants  C,  1  ^12*  and  are  tabulated 

in  Table  II.  We  extend  Eq.  (13)  to  calculate  the  strain  en¬ 
ergy  in  an  ABCj  alloy  by  treating  each  bond  and  each 
pair  of  the  same  bonds  in  the  same  way  as  in  the  constitu¬ 
ent  compounds.  However,  when  dealing  with  the  bond- 
angle  term  involving  two  unlike  AC  and  BC  bonds,  both 
do  and  /3,y  are  taken  to  be  the  average  values.  The  results 
from  VFF  will  be  compared  with  the  ETB  calculations  in 
Sec.  IV. 


IV.  DISCUSSION  OF  ALLOY  RESULTS 
A.  Excess  energies 

Before  presenting  the  results  for  different  kinds  of  ex¬ 
cess  energies,  the  readers  should  be  reminded  of  the  rela¬ 
tion  between  these  energies  and  ordering.  The  Appendix 
provides  a  qualitative  discussion  of  this  relation.  It  also 
describes  how  different  kinds  of  excess  energies  presented 
below  may  alter  the  picture  of  ordering. 

The  bulk  excess  energies  A£  calculated  from  ETB  for 
the  ABC2  alloys  in  the  three  structures  of  CA,  CH,  and 
CP  are  listed  in  Table  VI  along  with  the  results  from 
VFF  and  LDF.  First  we  observe  that  ETB  and  VFF  give 
very  similar  results.  Both  models  produce  very  small  A£ 
values  for  those  alloys  composed  of  compounds  with 
nearly  equal  lattice  constants.  However,  A£  for  AlGaPj 
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and  HgCdTe2  derived  from  ETB  are  slightly  negative. 
The  differences  between  the  ETB  and  VFF  results  in 
these  systems  are  a  measure  of  the  small  size  of  the 
chemical-energy  contribution  to  AE  in  the  ETB  model. 
For  the  lattice-mismatched  alloys,  all  the  AE  values  from 
ETB  are  positive  and  are  slightly  larger  than  those  ob¬ 
tained  from  VFF  except  for  InAlPi.  The  major  reason 
VFF  yields  a  smaller  AE  is  that  the  VFF  used  here  con¬ 
sistently  predicts  smaller  values  of  the  elastic  constant 
C44  than  the  experimental  values,  thereby  underestimat¬ 
ing  the  strain  contribution  to  AE.  However,  the  small 
chemical  energy  included  in  ETB  may  upset  this  trend, 
exemplified  by  InAlP,. 

The  quantitative  comparison  between  ETB  and  LDF  is 
mixed.  Starting  from  the  lattice-matched  alloys 
GaAlASi,  the  AE  from  LDF  calculated  by  different 
groups  range  from  7.5  to  35  meV  for  the  three  structures 
considered,  as  compared  to  nearly  zero  calculated  from 
ETB.  Similar  differences  between  LDF  and  ETB  also 
occur  in  another  lattice-matched  system,  HgCdTe,. 
Since  the  strain  energy  is  nearly  zero  in  these  systems,  the 
10-meV  or  so  difference  between  the  present  calculation 
and  the  majority  of  the  LDA  results  represents  the 
discrepancy  in  the  estimate  of  the  chemical  energy  in  AE 
between  the  two  theories.  These  differences  certainly  are 
well  within  the  margins  of  errors  of  both  ETB  and  LDF. 
However,  we  note  the  LDA  AE  values  for  GaAlAsi  in 
the  CA  structure  are  rather  consistent  except  the  35  meV 
from  Ref.  56.  For  the  lattice-mismatched  alloys,  ETB 
agrees  very  well  (within  20  meV)  with  LDF  for  the  III-V 
alloys  in  the  CP  structure  and  II-VI  alloys  in  the  CH 
structure.  The  agreement  is  also  reasonable  for  all  alloys 
in  the  CA  structure.  However,  the  differences  between 
the  two  calculations  are  more  substantial  for  the  III-V  al¬ 
loys  in  the  CH  structure  and  II-VI  alloys  in  the  CP 
structure.  We  note  that  although  the  trend 
A£ch  <^£^cp  aiTiong  the  three  structures  holds 

for  both  ETB  and  LDF  for  these  lattice-mismatched  al¬ 
loys,  the  A£ch  values  from  LDF  for  the  III-V  systems 
are  considerably  lower  than  those  from  ETB,  particularly 
the  negative  values  calculated  for  InAlP;  and  InAlASi. 
Despite  all  these  quantitative  differences,  the  qualitative 
trends  of  our  ETB  predictions  are  rather  similar  to  those 
from  LDF  by  Wei  and  co-workers'"  as  shown  in  Fig.  1. 
The  most  important  conclusion  that  can  be  drawn  from 
Table  VI  is  that  these  ordered  bulk  alloys  are  not  the 
thermal  equilibrium  states  at  the  experimental  growth 
temperatures  shown  in  Table  1.  For  this  to  happen,  the 
AE  value  has  to  be  less  than  —200  meV  per  four  atoms,*' 
as  mentioned  earlier.  This  AE  value  is  far  below  all  the 
calculated  values  and  is  beyond  the  uncertainties  of  our 
ETB  model  and  the  LDF  calculations  listed  above. 

While  the  above  excess  energies  do  not  support  a 
thermally  stable  ordered  bulk  alloy  at  the  growth  temper¬ 
ature  (=600°C),  they  may  offer  different  conclusions 
when  applied  to  the  epitaxially  grown  alloys.  The  idea 
was  first  suggested  by  Flynn^^  in  connection  with  epitaxi¬ 
al  growth  of  ordered  metal  alloys.  If  the  substrate  ma¬ 
terial  serves  as  a  reservoir  for  the  epitaxial  film,  then  the 
alloys  grown  on  the  substrate  have  to  be  in  thermal  and 
mechanical  equilibrium  with  the  substrate.  Since  most 


substrates  used  in  the  growth  are  selected  to  have  their 
lattice  constants  close  to  the  equilibrium  lattice  parame¬ 
ters  (see  Tables  1  and  IX)  of  the  ordered  alloys  mentioned 
above,  it  takes  little  strain  energy  for  these  alloys  to 
match  the  substrates.  On  the  other  hand,  if  an  alloy  on 
the  substrate  is  to  decompose  into  two  substrate  lattice- 
mismatched  constituent  compounds,  it  takes  energy  to 
constrain  the  separated  systems.  Only  the  case  of  thin 
layers  in  which  no  dislocations  form  will  be  treated. 
Then  the  reference  constituent  energies  EiAB)  and 
E{  AC)  in  Eq.  (1)  which  mu«t  be  used  to  calculate  the  ex¬ 
cess  energy  for  epitaxial  growth,  referred  to  as  A£,p,  are 
the  ones  with  their  lattice  parameters  matched  to  the  sub¬ 
strate.  This  coherent  strain  can  make  A£,p  negative  for 
an  alloy  even  when  its  bulk  excess  energy  AE  is  positive. 
Of  course,  the  actual  values  of  excess  energies  are  sub¬ 
strate  specific.  However,  an  overall  assessment  can  be 
made  when  the  substrate  is  also  an  alloy  by  choosing  the 
substrate  lattice  constant  to  be  the  average  value 

The  major  difference  between  AE^^  and  AE  then  comes 
from  the  strain  energies  of  the  constituent  compounds 
forced  to  lattice  match  the  substrate.  Since  the  energies 
of  the  ordered  alloys  at  their  equilibrium  lattice  constants 
a  only  differ  by  a  small  amount  (1  or  2  meV)  from  those 
at  a,  we  shall  use  the  alloy  energies  already  calculated  in 
Table  VI  to  deduce  the  epitaxial  excess  energy. 

The  strain  energy  of  either  constituent  compound 
matched  to  a,  =fl  on  the  (001)  substrate  can  be  estimated 
from  the  following  simple  formula: 

£,(/4C)  =  (C,,+2C,,-4C?2/C||  )5^//4  ,  (14) 

where  6o  =  (a^c  — the  percentage  lattice- 
parameter  difference  between  the  two  constituent  com¬ 
pounds.  The  strain  energies  used  to  calculate  the  values 
of  AEfp  in  Table  Vll,  however,  are  obtained  from  ETB, 
which  in  fact  only  produces  a  small  correction  (1  to  2 


AlinP,  GalnF^  Go^AsSb  GotnA^  Hg2nT^  CdZnT^  Co^Aa 


FIG.  1.  The  excess  energies  of  Eq.  (I)  for  the  ordered  alloys 
in  three  crystal  structures  calculated  from  the  present  TB  model 
(solid  circles)  and  from  LDF  in  Ref.  6  (open  circles). 
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meV  at  most)  to  Eq.  (14).  The  relative  values  of  A£,p 
among  the  three  structures  on  the  same  substrate,  say, 
(001),  should  remain  the  same  as  those  of  A£,  because 
these  two  energies  only  differ  in  the  reference  energy. 
For  example,  the  relation 

A£,p  ( CH  )<  A£,p  ( C  A  )<  A£,p  ( CP ) 

still  holds  for  alloys  composed  of  lattice-mismatched  con¬ 
stituent  compounds.  On  the  (001)  substrate,  the  A£^p 
values  are  essentially  all  positive  in  the  CP,  negative  in 
the  CH,  and  switch  between  positive  and  negative  values 
in  the  CA  structure.  The  CP  alloys  matched  to  the  (111) 
substr.ite  have  very  small  A£,p  with  magnitudes  smaller 
than  10  meV.  The  LDF  epitaxial  energies  calculated  by 
Wei,  Ferreira,  and  Zunger*"  have  the  same  qualitative 
trends  as  those  from  ETB  among  the  three  structures. 
The  quantitative  differences  in  A£^p  between  the  two  cal¬ 
culations  are  larger  and  also  more  scattered  than  those  in 
A£.  This  is  peculiar,  because  the  difference  between  A£ 
and  A£,,p  in  both  calculations  comes  from  the  strain  ener¬ 
gies  of  the  constituent  compounds,  which  can  be  reliably 
estimated  from  Eq.  (14)  and  are  well  prescribed  by  our 


ETB  model.  Despite  these  discrepancies,  a  simple  con¬ 
clusion  can  also  be  drawn  from  Table  VII  These  num¬ 
bers  still  cannot  account  for  the  ordering  observed  exper¬ 
imentally,  because  all  the  calculated  A£,p  are  still  well 
above  the  —  200-meV  value  required  for  the  ordering  at 
the  growth  temperatures.'^ 

In  the  above  consideration,  the  constituent  compounds 
are  constrained  to  match  the  lateral  lattice  of  the  sub¬ 
strate  and  are  allowed  to  relax  fully  in  the  growth  direc¬ 
tion.  For  a  (001)  substrate  with  a  lattice  constant  Oj  =3, 
the  c/a  ratio  for  either  compound  is  estimated  to  be 

c/fl  =  H-6y(C|,/C|, -1-0.5)  ,  (15) 

where  again  is  the  percentage  bond-length  difference 
between  the  two  constituent  compounds.  This  c  /a  relax¬ 
ation  results  in  elongation  for  one  compound  and  shrink¬ 
age  for  the  other  along  the  growth  direction.  If  the  lat¬ 
tice  constants  of  two  constituent  crystals  have  a  substan¬ 
tial  difference,  these  opposite  relaxations  will  produce 
strained  grain  boundaries  between  the  AC  and  BC  crys¬ 
tals.  The  reference  state  energy  £(  AC)  +  E {SO  used  in 


T.4BLE  VI.  The  bulk  e.xcess  energies  A£  (in  meV  per  four  atoms)  from  ETB  and  comparison  with  the  results  from  local-density- 
functional  ILDF)  theory  and  the  valence-force-field  (VFF)  model. 


CuAu  1 

Chalcopyriie 

CuPi 

Alloys 

ETB 

LDF 

VFF 

ETB 

LDF 

VFF 

ETB 

LDF 

VFF 

AlGaP; 

-2.4 

1.5 

-2.8 

1.0 

-2.6 

0.2 

AlGaAs; 

0.6 

10.8- 

0.3 

0.6 

11.4'’ 

0.2 

0.8 

7.5' 

0,4 

11.5'- 

9.8' 

15.1'’ 

13.5'’ 

35' 

AllnP; 

69.0 

43.0' 

74.0 

44.0 

-21.0' 

47.3 

114.2 

97.0' 

111.1 

AlInAs; 

68.8 

35.0' 

66.6 

45.8 

-15.0' 

43,7 

107.0 

97.6 

Ga,PAs 

30.0 

26.6' 

23.1 

19.4 

6.5* 

15.0 

33.6 

37.2* 

32.7 

Ga,PSb 

260.2 

207.0 

67.4 

135.8 

290.8 

292.6 

Ga^AsSb 

113.0 

129.2- 

114.8'’ 

115.0' 

91.8 

69.8 

52.0' 

59.9 

128.0 

132.0' 

130.1 

GaInPi 

88.4 

115.6- 

83.8 

57.2 

19.0’ 

54.3 

139.4 

155.4* 

124.9 

91.0’ 

54.4'' 

GalnAS; 

73.2 

60.1’ 

67.3 

48.2 

16.5* 

43.8 

113.0 

108.5* 

99.5 

83.6- 

66.7' 

GalnSb; 

57.4 

51.5 

37.4 

33.2 

85.58 

76.2 

ZnCdTe, 

34.3 

54.2* 

33.7 

22.4 

19.2* 

21.1 

65.3 

103.5* 

j6.0 

ZnHgTej 

29.7 

42.5* 

32.1 

21.0 

11.4* 

19.9 

54.9 

103.3* 

53.5 

CdHgTe, 

-2.3 

12.1* 

0.61 

-2.7 

11.3* 

0.4 

-2.7 

9.8* 

1.1 

■■Reference  28. 
^’Reference  3 1 . 
■■Reference  27. 
■'Reference  55. 
'Reference  56. 
'Reference  53. 


•Reference  6. 
•■Reference  30. 
‘Reference  32. 
’Reference  29. 
‘Reference  54. 
'Reference  74. 
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TABLE  VII.  The  excess  energy  (in  meV  per  four  atoms)  for  epitaxial  alloys  calculated  from  the  present  TB  model  and  com- 
parison  with  the  LDF  results. _ 


Structure 

substrate 

(Alloy) 

CuAu  1 
(100) 

TB 

LDF 

Chalcopyrile 

(100) 

TB  LDF 

(111) 

TB 

CuPl 

(100) 

TB 

(100) 

LDF 

AIGaP; 

-2.9 

-3.1 

0.5 

p 

1 

AlGaAs, 

0.8 

0.7 

0.4 

0.2 

AllnP: 

-  11.5 

-36.0'' 

-36.3 

-0.2 

33.8 

-18.0* 

AlInAs, 

-4.6 

-27.8 

-7.4 

33.6 

GajPAs 

5.3 

4.1* 

-5.3 

-21* 

0.2 

8.9 

0.5* 

Ga,PSb 

40.5 

-52.2 

7.8 

71.1 

Ga^AsSb 

15.5 

35.0*’ 

-27.6 

00 

-5.6 

30.5 

52.0“ 

17.0" 

-45.8' 

11.5* 

48.0" 

GaInPj 

-3.3 

-3.0" 

-34.5 

-106" 

-9.4 

47.6 

3.4* 

12.8" 

GalnAs, 

0.4 

-9.7" 

-25.4 

-9.6 

39.5 

4.1' 

10.5“ 

6.7“ 

8.4“ 

29.6" 

GalnSb; 

1.4 

-18.5 

-5.0 

29.9 

ZnCdTe, 

-5.6 

-5.4" 

-17.5 

-57.2' 

6.6 

25.5 

0.2' 

ZnHgTe, 

-5.9 

-  13.5" 

-14.7 

-64.2* 

3.0 

19.3 

0.9' 

CdHgTe; 

-2.4 

-2.8 

-3.0 

-2.9 

^Reference  6. 
'’Reference  27. 
'■Reference  28. 
‘‘Reference  29. 


the  calculation  of  in  Table  VII  assumed  that  the 

decomposed  AC  and  BC  phases  are  macroscopic  crystals. 
The  domain-wall  energy  contributions  were  neglected  be¬ 
cause  there  are  few  boundaries.  However,  a  realistic  path 
between  the  completely  separated  and  fully  relaxed  i4C 
and  BC  domains  is  likely  to  pass  through  a  sequence  of 
intermediate  states,  including  the  stage  of  forming  micro¬ 
scopic  AC  and  BC  clusters  which  serve  as  nucleation 
centers.  As  an  approximation  to  this  phase  space  im¬ 
mediately  adjacent  to  the  ordered  ABC2  alloy,  we  have 
estimated  the  metastable  nucleation  energy  barrier  by  as¬ 
suming  that  the  microscopic  AC  and  BC  clusters  are  lat¬ 
tice  matched  to  the  ABC2  alloy  and  their  energetics  can 
be  estimated  from  the  bulk  crystals  under  the  same  con¬ 
straint.  Because  the  c/a  ratios  for  the  ordered  alloy  are 
nearly  unity  (see  Table  VIII),  this  epitaxial  energy  against 
nucleation  of  ,4C  and  BC  clusters  is  equivalent  to  using  a 
reference  energy  E{  AC)  +  E(BC)  which  disallows  the 
c  /a  relaxation.  Consequently,  this  epitaxial  excess  ener¬ 
gy  for  the  alloy,  referred  to  as  A£,p  (the  hard  model),  is 
lowered  further  as  shown  in  Table  VIII.  Note  that  the 
values  for  a  number  of  alloys  have  already  attained  mag¬ 
nitudes  that  could  account  for  the  stability  of  the  ob¬ 
served  LRO  of  lattice-mismatched  alloys.  However,  de¬ 
tails  of  the  mechanism  that  causes  the  LRO  to  preferen¬ 
tially  form  in  the  first  place  must  still  be  determined. 


B.  Bond  lengths 

In  addition  to  the  excess  energies,  our  calculations  also 
produce  detailed  information  about  the  equilibrium 
structures  of  the  alloys.  Table  IX  lists  the  c/a  ratio  and 


TABLE  VIII.  The  excess  energy  A£,^  (in  meV  per  four 
atoms)  for  epitaxial  alloys  without  (c/o)-ratio  relaxation. 


CuAu  I 

Chalcopyrite 

CuPt 

AlGaPj 

-3.1 

-3.4 

-3.3 

AIGaAsj 

0.7 

0.6 

0.2 

AllnP; 

-  192.5 

-217.4 

-147.3 

AIInAs2 

-  149.0 

-172.2 

-110.8 

Ga^PAs 

-37.5 

-48.1 

-33.9 

GajPSb 

-354.4 

-447.1 

-323.8 

GajAsSb 

-162.2 

-205.2 

-147.1 

GaInPj 

-200.4 

-231.6 

-  149.5 

GaInAs2 

-149.8 

-174.8 

- 109.9 

GaInSb2 

-113.9 

-133.8 

-85.3 

ZnCdTe2 

-131.6 

-143.5 

-100.5 

ZnHgTe2 

-150.3 

-159.1 

-125.1 

CdHgTe, 

-2.8 

-3.2 

-3.2 
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the  first-neighbor  bond  lengths  for  all  the  alloys  studied. 
These  results  are  particularly  useful  if  experiments  such 
as  EXAFS  are  earned  out  to  measure  the  local  structures 
of  these  LRO  alloys.  As  mentioned  earlier,  the  c  /a  ratios 
are  nearly  unity  for  all  alloys.  The  notations  used  in  the 
table  are  such  that  the  equilibrium  bond  lengths  of  the 


constituent  crystals  are  in  the  order 
bond  lengths  in  the  CA  and  CH  structures  are  in  general 
bimodal,  with  result  is  similar  to  that 

found  in  the  disordered  bulk  alloys.  There  are  four 
different  bond  lengths  in  the  CP  structure:  d^c> 

dg(-,  and  d|c>  where  superscripts  T  and  S  mean  the  triple 


TABLE  IX.  The  equilibrium  bond  lengths  and  dgc  and  the  equilibrium  average  lattice  con¬ 
stants  a  (all  in  A).  In  CP  structure,  the  first  values  d^(;  and  dgc  are  for  those  bonds  along  the  (111) 
direction,  and  the  second  values  are  for  those  in  the  other  three  directions. 


ABC, 

c /a 

a 

^  AC 

dgc 

AlGaAS; 

CA 

1.000 

5.658 

2.450 

2.449 

CH 

1.000 

5.656 

2.450 

2.449 

CP 

0.998 

5.660 

2.451 

2.449 

2.444 

2.451 

AlGaP, 

CA 

1.000 

5.457 

2.365 

2.362 

CH 

1.000 

5.459 

2.366 

2.361 

CP 

0.998 

5.461 

2.371 

2.362 

2.354 

2.366 

InGaSb^ 

CA 

1.009 

6.272 

2.781 

2.669 

CH 

0.995 

6.298 

2.788 

2.660 

CP 

1.001 

6.295 

2.788 

2.762 

2.650 

2.699 

InAIAs; 

CA 

1.013 

5.831 

2.597 

2.479 

CH 

0.992 

5.870 

2.604 

2.469 

CP 

1.001 

5.861 

2.599 

2.578 

2.456 

2.512 

InGaAs; 

CA 

1.012 

5.834 

2.597 

2A1» 

CH 

0.993 

5.866 

2.605 

2.468 

CP 

1.002 

5.859 

2.602 

2.577 

2.457 

2.510 

InAlP; 

CA 

1.007 

5.658 

2.520 

2.395 

CH 

0.996 

5.674 

2.527 

2.384 

CP 

1.001 

5.677 

2.516 

2.504 

2.381 

2.425 

InGaP: 

CA 

1.013 

5.640 

2.518 

2.390 

CH 

0.993 

5.672 

2.526 

i.m 

CP 

1.003 

5.665 

2.520 

2.499 

2.368 

2.425 

AsPGa, 

CA 

1.001 

5.549 

2.430 

I.m 

CH 

0.999 

5.552 

2.437 

2.371 

CP 

0.998 

5.556 

2.450 

2.422 

2.354 

2.390 

SbAsGa, 

CA 

1.003 

5.880 

2.601 

2.499 

CH 

0.998 

5.884 

2.613 

2.482 

CP 

0.995 

5.905 

2.637 

2.585 

2.439 

2.531 

SbPGa, 

CA 

1.008 

5.776 

2.586 

2.435 

CH 

0.994 

5.787 

2.603 

2.407 

CP 

0.993 

5.822 

2.635 

2.567 

2.344 

2.486 

CdHgTe, 

CA 

1.000 

6.471 

2.805 

2.799 

CH 

1.000 

6.471 

2.806 

2.798 

CP 

0.999 

6.473 

2.817 

2.798 

2.788 

2.805 

HgZnTe^ 

CA 

1.010 

6.256 

2.783 

2.656 

CH 

0.995 

6.286 

2.787 

2.650 

CP 

1.003 

6.277 

2.771 

2.770 

2.659 

2.679 

CdZnTe, 

CA 

1.016 

6.252 

2.790 

2.657 

CH 

0.991 

6.302 

2.795 

2.650 

CP 

1.007 

6.275 

2.787 

2.771 

2.647 

2.685 
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TABLE  X.  I  he  ratios  y  =  (t/jn,,.  ~  d  —  d  >  for  the  ordered  alloys  calculated  from  the  present  TB 
model  and  comparison  w  ith  experimental  results  for  the  disordered  alloys. 


ABC: 

r  u 

CA 

ETB 

Vftc 

}' 

CH 

ETB 

Vhc 

r  4(: 

Disordered  alloys 
Experiments 

1'  ec 

InGaSb; 

oil 

0.64 

0.80 

0.76 

0.89- 

0.88- 

InGaAs, 

0.71 

0.66 

0.80 

0.78 

0.77- 

0.80- 

InGaP; 

0.74 

0.67 

0,83 

0.79 

0.80- 

0.76- 

AsPGa. 

0.59 

0.5t 

0.75 

0.75 

0.76- 

0.75- 

HgZnTc; 

0.81 

0.76 

0  86 

0.84 

0.72'’ 

0.73" 

■“Reference  58. 
^Reference  2. 


and  single  bonds,  respectively.  However,  the  alloy  lattice 
constants  a  are  quite  close  to  the  mean  value  a  of  the  con¬ 
stituent  compounds.  A  more  sensitive  measure  of  the  bi- 
modal  distribution,  following  Boyce  and  Mikkelsen,’  is  to 
look  at  the  ratio  y  ac~^ ^  being 

the  average  bond  length,  and  similarly,  yg(^.  Table  X 
compares  these  ratios  for  the  CA  and  CH  structures  with 
the  experimental  values  for  the  disordered  bulk  alloys. 
Note  that  a  value  of  7'  =  1  corresponds  to  the  totally  re¬ 
laxed  case  where  there  is  no  bond  stretching,  whereas 
y=0  corresponds  to  a  rigid  virtual  crystal  where  all  the 
atoms  are  on  the  zinc-blende  crystal  sites.  In  terms  of  the 
valence-force  model,  this  ratio  is  roughly  }'  =  a/(a  +  2/3), 
where  a  and  /3  [see  Eq.  (13)]  are  the  average  values  for  the 
bond-stretching  and  angular-restoring-force  constants  of 
the  two  constituent  compunds.  The  calculated  y  ratios  in 
Table  X  follow  this  trend  (see  the  a  and  P  values  in  Table 
II).  However,  the  values  of  y  show  that  CH  structures 
are  more  relaxed  than  the  CA  structures.  This  result  is 
consistent  with  the  lower  A£  values  in  CH  than  in  CA 
shown  in  Table  VI.  The  ETB  values  of  y  for  GalnAs,, 
GaInPj,  and  Ga^AsP  in  the  CH  structure  are  very  close 
to  the  experimental  values*  for  the  disordered  alloys. 


This  result,  when  correlated  with  the  lower  excess  energy, 
may  suggest  that  the  disordered  alloys  tend  to  favor  a  lo¬ 
cal  configuration  of  the  CH  structure.  However,  the  cal¬ 
culated  y  values  for  GalnSb,  and  HgZnTe,  do  not  corre¬ 
late  well  with  the  experimental  results.  As  a  matter  of 
fact,  the  trend  as  a  function  of  the  p/a  ratio  in  the  exper¬ 
imental  results  is  reversed.  It  would  be  interesting  to  see 
if  these  two  ordered  alloys  do  have  different  bond  lengths 
from  the  disordered  states.  While  the  values  of }'  in  this 
table  range  from  0.6  to  0.9  for  the  lattice-mismatched  al¬ 
loys,  this  trend  does  not  hold  for  the  lattice-matched  al¬ 
loys.  For  example.  Table  IX  shows  that  for  Hg — Te  and 
Cd — Te  bond  lengths  in  HgCdTe,,  both  CH  and  CA 
structures  nearly  retain  their  respective  constituent  crys¬ 
tal  values  (i.e.,  7'  =  1),  which  was  also  seen  in  a  recent  ex¬ 
periment  on  the  bulk  alloy. Finally,  the  bond  lengths 
for  the  CP  structure  are  characteristically  different  from 
those  in  CA  and  CH  structures  The  single  bonds  along 
the  ordering  direction  [111]  tend  to  be  close  to  the  con¬ 
stituent  values  while  the  triple  bonds  in  the  other  direc¬ 
tions  have  less  relaxations  (with  y  values  around  0.5  or 
less). 


TABLE  XI.  Bulk  moduli  (in  10"  dyn/cm’i  of  ordered  alloys  in  three  crystal  structures  calculated  from  the  present  TB  model  and 
the  percentage  deviations  AB  -(B  —  )/R,.  from  the  average  values  R,.  of  the  constituent  compounds. 


^CH 

Bqp 

ARch 

(%) 

ARca 

{%) 

ARcp 

(%) 

AlGaP, 

8.858 

8.858 

8.854 

8.8715 

-0.150 

-0.15 

-0.201 

AlGaAsi 

7,695 

7.693 

7.689 

7.7085 

-0.181 

-0.20 

-0.257 

AllnP, 

7.876 

7.860 

7.774 

7.9235 

-0.605 

-0.08 

-1.882 

AlInAs; 

6.705 

6.691 

6.661 

6.7605 

-0.828 

-1.03 

-  1.475 

Ga2PAs 

8.328 

8.291 

8.294 

8.4165 

-1.046 

-1.50 

-1.457 

GajPSb 

6.584 

6.297 

6.188 

7.4675 

-11.836 

-15.68 

-  17.135 

GaiAsSb 

6.314 

6.198 

6.157 

6.7410 

-6.342 

-8.05 

-  8.662 

GaInPj 

8.007 

8.035 

8.878 

8.1950 

-1.437 

-1.95 

-3.865 

GalnAs, 

6.610 

6.579 

6.508 

6.7420 

-1.961 

-2.42 

-  3.474 

GalnSb, 

5.226 

5.202 

5.156 

5.3115 

-1.607 

-2.07 

-2.923 

ZnCdTej 

4.6105 

4.6035 

4.3375 

4.6500 

-0.85 

-  1.00 

-6.72 

ZnHgTe: 

4.8898 

4.8872 

4.6323 

4.9245 

-0.71 

-0.76 

-5.93 

CdHgTe; 

4.4697 

4.4721 

4.4706 

4.4845 

-0.33 

-0.28 

-0.31 
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C.  Bulk  moduli 

Table  XI  lists  the  results  for  the  bulk  moduli  calculated 
from  ETB.  Also  lifted  are  the  percentage  deviations 
from  the  mean  AB  /B.  Note  that  ail  AB  values  are  nega¬ 
tive.  The  magnitudes  of  AB  are  small  except  for 
Ga^AsSb  and  Ga^FSb  which  have  large  dilferences  in 
both  B  and  d  between  the  constituent  compounds.  Al¬ 
though  the  magnitudes  of  AB  get  larger  for  systems  with 
larger  differences  in  the  bond  lengths,  the  dependence 
does  not  seem  to  be  a  simple  function  of  the  bond-length 
difference.  The  uniformly  negative  AB  values  were  also 
obtained  from  LDF  by  Ferreira  et  al.*^  for  the  Ga.AsSb 
alloys.  One  reason  for  the  negative  values  of  AB,  in  a 
very  qualitative  argument,  is  due  to  the  fact  that  the  bulk 
moduli  of  semiconductors  scales  inversely  as  high  powers 
of  the  lattice  constant,*''  and  at  the  same  time,  the  alloy 
lattice  constant  is  appro.ximated  veil  'y  the  mean 
value — the  Vegard  law.  This  qualitati.e  behavior  also 
comes  out  of  a  simple  VFF  analysis  for  disordered  al¬ 
loys.*’' 

V  .  SUMMARY  AND  CONCLUSION 

This  work  was  motivated  by  our  interest  in  under¬ 
standing  the  mechanism  of  ordering  for  the  semiconduc¬ 
tor  alloys  grown  by  MBE  and  MOCVD.  We  have  ap¬ 
plied  an  empirical  tight-binding  (ETB)  model  to  sys¬ 
tematically  interpolate  the  alloy  total  energies  from  those 
of  the  constituent  compounds.  Since  the  strain  energy 
makes  a  dominant  contribution  to  the  excess  energy  AE, 
particular  attention  has  been  given  the  elastic  properties, 
in  addition  to  the  lattice  constants  and  cohesive  energies. 
Our  calculated  bulk  excess  energies  AE  are  positive  for 
all  alloys  composed  of  lattice-mismatched  compounds, 
and  nearly  zero  for  the  lattice-matched  systems.  Based 
on  these  results,  we  conclude  that  all  the  ordered  semi¬ 
conductor  alloys  found  experimentally  are  not  in  their 
thermodynamic  stable  states  at  the  experimental  growth 
temperatures.  The  same  conclusion  can  also  be  drawn 
from  the  most  recent  LDF  calculations*’’^°  *'“’^-^’“’*’  list¬ 
ed  in  Table  VI.  We  note  that  several  earlier  theories*^”** 
that  concluded  a  stable  ordering  for  these  alloys  have  all 
been  revised''’  *’'  (see  also  comments  in  Ref.  71). 

Our  calculation  also  generates  detailed  information 
about  the  structures  and  bulk  moduli  of  these  alloys. 
These  results  should  be  checked  experimentally. 

To  further  explore  the  stability  of  these  alloys,  we  ex¬ 
amined  the  energetics  for  spontaneous  ordering  when  the 
grown  materials  are  constrained  to  match  a  substrate  lat¬ 
tice.  Two  kinds  of  epitaxial  energies  are  calculated;  One, 
denoted  AE^.^,  corresponds  to  the  situation  in  which  the 
constituent  compounds  are  allowed  to  relax  fully  along 
the  growth  direction;  and  the  other,  the  hard-model 
AE^,  does  not  allow  c/a  relaxation.  Although  the  sign 
of  AEgp  can  be  negative,  the  magnitudes  are  too  small  to 
account  for  the  observed  spontaneous  ordering.  Howev¬ 
er,  the  sign  and  magnitudes  of  AE'^  for  a  number  of 
lattice-mismatched  alloys  are  found  to  be  comparable  to 
the  energies  needed  for  ordering  at  the  growth  tempera¬ 
tures.  Whether  or  not  this  is  a  plausible  mechanism 


deserves  a  more  detailed  investigation. 

Even  if  AE^  turns  out  to  be  a  possible  mechanism, 
based  on  the  considerations  presented  in  this  paper  we 
are  still  left  without  an  explanation  for  the  ordering  in 
the  lattice-matched  alloys  such  as  GaAlAs^.  We  do  not 
believe  that  any  refined  bulk  calculation  will  produce  an 
excess  energy  with  large  enough  magnitude  to  account 
for  the  observed  ordering  in  GaAlAs,.  On  the  other 
hand,  epitaxial  growth  is  very  surface  sensitive.  Because 
of  changes  in  the  bonding  character  at  surfaces,  e  g.,  dan¬ 
gling  bond,  charge  transfer,  and  reconstruction,  the  sur¬ 
face  structural  energies  behave  quite  differently  from 
those  in  the  bulk,  and  do  not  extrapolate  from  the  bulk, 
energies.  New  mechanisms  for  spontaneous  ordenng 
may  emerge  from  surface  energetic  considerations.  Some 
hopeful  thoughts’^  along  this  line  have  already  been  sug¬ 
gested. 
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APPENDIX:  EXCESS  ENERGY  AND  ORDERING 

The  excess  energies  AE,  AE^^,  and  AE^  considered  in 
the  text  for  the  ordered  alloys  have  to  be  considered 
along  with  the  free  energies  of  other  possible  phases  in¬ 
cluding  the  disordered  alloy  in  order  to  determine  if  the 
ordered  phase  is  stable  at  a  given  temperature.  A 
schematic  mixing  free-energy  curve  AF  for  a  disordered 
pseudobinary  alloy  /f  |_^B^C  as  a  function  of  alloy  com¬ 
position  X  at  several  different  temperatures  is  shown  in 
Fig.  2.  Figure  2(a)  is  for  the  case  AE  >  0,  and  Fig.  2(b)  for 
A£<0.  (These  curves  are  similar  to  Fig.  1  in  Ref  57.)  If 
A£>0,  then  the  stable  phase  is  either  the  segregated 
phase  or  a  uniform  disordered  solution  depending  on 
whether  the  temperature  T  is  lower  or  greater  than  the 


(o)  AE  >  0  (b)  AE  <  0 


0  0.5  1  0  0.5  1 

AC  X  BC  AC  X  0C 


FIG.  2.  Schematic  plots  of  the  mixing  free  energy  of  a  disor¬ 
dered  alloy  as  function  of  alloy  composition  x  (dotted  lines)  for 
(a)  AE>0  and  (b)  AE<0.  The  dashed  line  represented  AE  for 
an  ordered  alloy  ABCi- 
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critical  temperature  F,,,  for  ph..se  segregation  or  forming 
miscibility  gaps.  However,  if  A£  <  0,  then  the  ordered 
pha.se  is  stable  at  a  temperature  Flower  than  the  critical 
temperature  for  the  order-disorder  transition. 

A  crude  estimate  of  the  magnitude  of  XE  in  Fig.  2(b) 
required  for  ordering  at  a  given  temperature  F  can  be 
made  by  assuming  that  the  disordered  phase  is  a  random 
alloy.  Then  in  the  present  units  of  energy  (for  an  ABC^ 
molecule),  the  magnitude  of  the  ordering  energy  defined 
as  A£(j=  must  be  greater  than  2A;Fln2, 

where  XE^  is  the  mixing  energy  for  the  disordered  phase 
for  the  case  at  x  =-i  and  F=0,  also  shown  in  Fig.  2(b). 
At  a  typical  epitaxial  growth  temperature  of  600 °C  (see 
Table  I),  the  minimum  ordering  energy  is  estimated  to  be 
XEq  =  100  meV.  Taking  A£p  =  XE/2,  one  finds  a  critical 
value  of  A£  =  — 200  meV.  However,  if  a  strict  pair- 
potential  nearest-neighbor  model  is  used,  XEq  =  ^A£  for 
both  the  CuAu  1  and  chalcopyrite  structures  and  the  re¬ 
quired  A£  value  becomes  —400  meV  in  this  estimate. 


Finally,  consider  the  implication  of  different  excess  en¬ 
ergies  tc  ordering.  Let  us  take  GaiAsSb  in  the  chalcopy- 
nte  structure  as  an  example,  for  which  the  three  excess 
energies  are  70,  —28,  and  —  205  meV'  for  A£,  A£,,^,  and 
XE^,  respectively  (see  Tables  VI,  \’IL  and  VIIL  from  the 
TB  calculation.  The  positive  value  of  A£  corresponds  to 
the  case  of  Fig.  2iai,  which  shows  tha'  the  ordered  bulk 
phase  IS  not  stable  at  any  temperature.  Although  the 
negative  value  A£,,^  -  —28  meV'  corresponds  to  the  case 
III  Fig.  2(b),  the  corresponding  critical  temperature  T.  is 
very  low.  This  implies  a  simple  constraint  that  the  lateral 
lattice  constants  of  '.he  grown  material  matched  to  the 
substrate  lattice  are  not  enough  to  produce  a  meiastable 
ordered  alloy  at  the  experimental  growth  temperature  ot 
600  °C.  However,  the  significant  negative  value  of 
A£,p  =  —205  meV,  resulting  from  a  further  constraint  in 
which  the  c  axis  relaxation  is  not  allowed,  may  raise  the 
F^  in  Fig.  2(b)  into  the  range  of  the  growth  temperature 
and  make  the  meiastable  ordered  phase  a  possibility. 
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ABSTRACT 

Native  point  defects  play  an  important  role  in  HgCdTe.  Here  we  discuss  some  of  the 
relevant  mass  action  equations,  and  use  recently  calculated  defect  formation  energies  to 
discuss  relative  defect  concentrations.  In  agreement  with  experiment,  the  Hg  vacancy  is 
found  to  be  the  dominant  native  defect  to  accommodate  excess  tellurium.  Preliminary 
estimates  6nd  the  Hg  antisite  and  the  Hg  interstitial  to  be  of  comparable  densities.  Our 
calculated  defect  formation  energies  are  also  consistent  with  measured  diffusion  activation 
energies,  assuming  the  interstitial  and  vacancy  migration  energies  are  small. 


INTRODUCTION 

Sophisticated  infrared  devices  are  currently  made  from  HgCdTe.  While  the  material 
has  entered  a  manufacturing  phase,  there  remain  many  unanswered  questions  about  its 
nature.  Native  point  defects  appear  to  play  a  far  more  important  role  in  the  proper¬ 
ties  of  HgCdTe  than  they  do  in  other  technologically  important  semiconductor  materials. 
The  formation  energies  of  these  defects  and  their  associated  localized  states  in  the  band 
gap  are  still  not  completely  characterized.  Similarly,  the  characterization  of  impurities  in 
HgCdTe  is  far  from  complete.  Beyond  point  defects  it  is  well  known  that  a  number  of  ex¬ 
tended  defects  are  important.  The  best  studied  are  tellurium  inclusions  and  dislocations 
which  form  at  heterojunctions  and  at  HgCdTe-substrate  interfaces.  As  grown  material  is 
tellurium-rich,  the  excess  Te  is  accommodated  in  the  form  of  Hg  vacancies  and  tellurium 
inclusions.  As  such,  post  growth  anneals  in  a  Hg  vapor  overpressure  are  typically  per¬ 
formed  to  reduce  the  Hg  vacancies  prior  to  other  device  processing;  this  converts  high 
purity  undoped  material  from  p-type  to  n-type.  The  donor  causing  the  n-type  doping  has 
not  been  identified,  although  it  is  believed  to  be  a  residual  impurity,  rather  than  a  native 
defect. 

One  of  the  more  perplexing  properties  of  HgCdTe  stems  from  the  fact  that  it  is  an 
alloy.  Until  a  few  years  ago  it  was  thought  that  all  tetrahedrally  coordinated  zincblende 
semiconductor  alloys  of  the  form  Ai.^BxC  were  random,  with  the  C  atoms  occupying 
one  fee  sublattice  amd  the  A  and  B  atoms  sharing  the  other  sublattice  at  random.  This 
has  now  been  demonstrated  to  be  incorrect  in  many  semiconductor  alloys.  The  nature 
of  the  correlations  can  be  characterized  by  counting  the  populations  of  those  clusters 
consisting  of  a  centered  C  atom  and  its  four  surrounding  A  or  B  atoms.  There  are  five 
types  of  clusters  of  the  form  A^.^BnC  with  n=0,  1,  2,  3,  or  4.  If  the  alloy  is  random 
the  cluster  occupation  probabilities,  pn,  form  the  binomial  distribution,  i.e.  Pn,r«ndom  ~ 
(n)(l  -  x)^-“x".  In  correlated  material,  deviations  from  this  distribution  are  found.  For 
example,  if  at  x=0.25  the  material  tends  towards  a  regular  compound,  then  pj  is  increased 
while  the  other  cluster  occupation  probabilities  ate  reduced  relative  to  Pn, random-  ^ 
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material  tends  towards  spinodal  decomposition,  then  pg  and  p4  are  enhanced  in  separate 
regions  of  space,  and  the  other  cluster  occupation  probablililies  are  reduced.  In  most 
semiconductor  alloys,  the  major  forces  for  correlations  are  strain  energies  resulting  from 
bond  length  differences  between  the  AC  and  BC  constituents.  Thus  theory  predicts 
that  HgCdTe  will  be  nearly  random  because  HgTe  and  CdTe  are  nearly  lattice  matched. 
Calculations  that  include  chemical  energies  and  charge  shifts,  in  addition  to  the  strain 
e-iergies,  have  also  predicted  HgCdTe  to  be  a  nearly  random  alloy.  Yet  several  recent 
experiments  on  alloys  with  x=0.20-0.2o  have  suggested  that  HgCdTe  is  correlated,  with 
Pn  >  Pn.random  "=0  and  1,  and  p„  <  Po,,»ndom  "=2,  3,  and  4.  If  this  is  the 
case  it  may  have  important  consequences  on  diffusion  and  transport  properties.  The 
consequences  of  such  correlations  on  the  defect  populations  are  discussed  further  below. 


NATIVE  DEFECTS 


The  doubly  ionized  cation  vacancy  is  believed  to  be  the  dominant  native  defect  in 
HgCdTe,  that  dictates  the  electrical  behavior  of  the  undoped  material  (1,  2).  We  have 
recently  reported  the  results  of  calculations  of  native  and  impurity  defect  total  energies  in 
HgTe,  CdTe  and  ZnTe  [3].  The  energies  for  the  formation  of  various  native  point  defects 
in  HgTe  are  summarized  In  Table  I;  the  HgTe  solid  and  the  Hg  in  the  vapor  are  used  as 
references  (4|.  These  calculations  used  the  linearized  muffin  tin  orbital  (LMTO)  method 
within  the  local  density  approximation  (LDA)  to  the  exchange  correlation  energy.  Large 
supercells  containing  one  defect  per  cell  were  repeated  periodically,  and  from  the  differ¬ 
ence  in  total  energies  per  cell,  with  and  without  the  defect,  the  defect  formation  energy 
was  calculated  fo  expedite  the  calculations,  the  energies  were  calculated  within  atomic 
spheres  approximation  (ASA)  with  a  small  (spd)  basis  set.  In  the  ASA,  an  approximation 
to  the  exact  density  functional  is  evaluated:  as  a  result,  an  error  is  introduced  which 
is  larger  than  in  other  LDA  methods  [5],  and  relaxation  energies  cannot  be  accurately 
calculated.  Thus  only  those  differences  in  energies  here  that  are  >  0.5  eV  here  should  be 
viewed  as  significant  for  these  calculations. 

An  appropriate  set  of  mass  action  constants  for  the  neutral  defect  reactions  is  also 
given  in  Table  I.  The  notation  in  that  table  is  as  follows.  Ab  corresponds  to  an  A 
species  occupying  a  B  site,  where  I  corresponds  to  an  interstitial,  and  V  to  a  vacancy.  No 
subscript  on  a  species  indicates  it  is  occupying  the  correct  lattice  site,  e.g.,  Hg  =  Hggg. 
Square  brackets,  [  ],  refer  to  concentrations.  A  subscript  “g"  indicate  the  species  in  the 
gaseous,  or  vapor,  phase  and  Pbj  is  the  Hg  vapor  pressure.  Most  of  the  reactions  in 
Table  I  involve  the  creation  or  destruction  of  one  or  more  unit  cells.  Because  the  resulting 
change  in  volume  is  accommodated  at  the  surface,  the  change  in  the  number  of  unit  cells 
will  enter  into  the  determination  of  the  defect  equilibrium  through  the  surface  entropy. 
Additionally,  surface  preparation  and  orientation  will  affect  the  surface  free  energy.  We 
have  assumed  for  the  present  that  such  surface  effects  are  neglible,  i.e.,  that  the  volume 
expansions  and  contractions  can  occur  with  negligible  changes  in  the  surface  properties. 

To  correctly  complete  the  defect  equilibrium  determination  one  must  include  the  equi¬ 
libration  of  the  electronic  charges  of  the  system.  To  do  so,  one  must  have  knowledge  of 
the  dominant  charge  states  of  the  defects,  and  their  activation  energies  with  respect  to 
the  neutral  defect.  Such  calculations  are  complicated  by  the  fact  that  most  a6  initio 
calculations  of  the  electronic  band  structure  of  semiconductors  predict  an  incorrect  band 
gap,  Eq,  a  shortcoming  of  the  local  density  approximation  (LDA.)  As  such,  we  shall  focus 
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Table  I.  Defect  reactions  and  formation  energies 


Defect  reaction 

Ev„,  +  HgTe-.VagTe  +  Hgg 

+  2HgTe  —  Te^gTe  +  2Hgg 
Exe,  +  HgTe  —  Te;  +  Hgg 
Ev^.  +  Hgg-HgVx. 

^Hg,  +  Hgg  ~  Hgi 


Defect  concentration 

Energy 

(eV) 

2.24- 

[TeHgl  =  Pii'g^X,'=n>(^) 

4.53 

(Tei)  =  P-H‘gA-0._exp(lfpt) 

4.96 

[VxJ  =  Phj  exp(-j^) 

3.12“ 

[HgTel  =  Ph* 

-0.42 

(Hg,l  =  PB*A'&„'=^(4^) 

0.84,  0.98 

’  Experimental  number  from  Vydyanath  (1,  2]. 

**  Calculated  using  tight  binding  Hamiltonian[6]. 


on  the  neutral  native  defects  here,  and  the  established  or  expected  charge  states  of  these 
defects.  In  wide  band  gap  materials  the  defects  equilibration  can  be  substantially  affected 
by  the  Fermi  level;  for  example  the  formation  energy  of  a  donor  will  decrease  when  the 
Fermi  energy  is  near  the  valence  band  edge,  since  the  donor  electron  can  drop  into  a 
vacant  state  near  the  valence  band,  thereby  lowering  the  energy  by  ~  Eq.  Because  we  are 
discussing  HgCdTe  with  a  narrow  band  gap,  we  expect  the  Fermi  effects  to  be  small,  but 
not  insignificant  at  high  temperatures.  Because  of  the  small  conduction  band  effective 
mass,  though,  in  n-type  material  the  filling  of  the  conduction  band  states  by  electrons 
can  shift  the  Fermi  energy  significantly;  combined  with  the  increase  in  the  band  gap  for 
the  high  temperatures  at  which  most  defect  studies  are  done,  the  effective  band  gap  can 
be  significantly  larger  than  the  usual  77  K  bandgap  associated  with  a  given  concentration 
of  HgCdTe. 

First  we  consider  the  defects  which  accommodate  excess  tellurium,  the  first  three 
defects  in  Table  I.  The  mass  action  constants  are  given  by  a  product  of  the  form 


Ki  =  A'?e^  (1) 

where  AE;  is  the  change  in  the  enthalpy  for  the  i***  re!M:tion.  For  the  first  three  equations, 
the  A'p  are  given  by 


=  Co(kT)^(2xm„g)?h-3exp(^^)  , 

(2) 

=  Co(kT)®(2Tm„g)’b-«exp(-J)  , 

(3) 

A'?-,,  =  Co(kT)^2xm„g)?h-3exp(^^)  . 

(4) 
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Here  T  is  the  temperature  in  Kelvin,  k  is  Boltzmann's  constant,  is  the  mass  of  the 
mercury  vapor  atoms,  h  is  Planck's  constant,  AS,  is  the  change  in  vibrational  entropy 
upon  formation  of  the  defect,  and  Co  converts  from  site  fraction  to  volume  concentrations. 
Because  two  unit  cells  of  HgTe  are  destroyed  when  a  tellurium  antisite  is  created,  compared 
to  one  unit  cell  when  a  mercury  vacancy  is  created,  we  do  not  expect  that  exp((  - 

ASy  )/kT)  ~  1.  While  we  have  not  completed  the  evaluation  of  these  entropy  terms, 

our  preliminary  estimates  indicate  this  ratio  is  ~  10'^.  For  the  tellurium  interstitial  and 
the  mercury  vacancy  we  expect  that  xp((AS-j-^  -  ASy^  )/kT)  ~  1  will  be  correct  within 
a  factor  of  10.  Evaluating  the  numerical  constants,  we  find 

=  10-‘0  (5) 

and 

(6) 

for  T=500  C  and  Pg^  =  1  atm.  The  conclusion  from  Equations  5  and  6  that  the  mercury 
vacancy  is  the  dominant  native  defect  is  consistent  with  experimental  observation.  This 
conclusion  is  unchanged  if  we  include  the  possibility  that  the  species  may  be  ionized  at 
the  equilibration  temperature  where  the  material  is  expected  to  be  intrinsic.  Although 
the  tellurium  antisite  density  decreases  more  rapidly  with  decreasing  Hg  pressure  than 
does  the  mercury  vacancy  density,  the  point  at  which  the  concentrations  are  comparable 
is  at  less  than  Pfjg  ~  10"'®  atm,  and  certainly  the  HgTe  phase  boundary  is  reached  before 
such  low  Hg  pressures  can  be  achieved.  This  is  also  consistent  with  the  fact  that  no  p- 
to-n  conversion  is  observed  for  low  mercury  pressures,  as  would  be  expected  if  tellurium 
antisites  became  the  majority  native  defect.  Because  the  pressure  dependences  of  the 
tellurium  interstitial  and  the  mercury  vacancy  concentrations  are  the  same,  the  above 
conclusion  will  hold  independent  of  the  mercury  pressure. 

We  have  checked  the  sensitivity  of  the  calculated  concentration  ratios  to  the  magnitude 
of  the  reaction  enthalpy.  Because  the  enthalpies  enter  in  the  exponents,  small  changes 
in  the  enthalpies  will  result  in  large  changes  in  the  predicted  defect  concentrations.  For 
example,  let  us  assume  that  our  calculated  antisite  formation  enthalpy  is  in  error  by  0.5 
eV;  in  this  case  the  ratio  of  antisite  to  vacancy  concentrations  (at  500  C)  will  be  reduced 
to  ~  1  10"".  For  an  antisite  formation  enthalpy  in  error  by  1.0  eV,  this  ratio  is  reduced 
to  ~  1  10"'.  We  do  not  expect  the  ASA  errors  to  exceed  0.5  eV  [5]. 

If  the  HgCdTe  is  not  completely  annealed,  and  tellurium  precipitates  are  still  present, 
the  defect  equilibrium  will  not  be  that  predicted  by  the  mass  action  equations  given  in 
Table  I.  For  example,  near  the  inclusions  we  can  assume  that  the  defects  will  be  nearly 
in  equilibrium  with  the  tellurium  solid;  thus 

E'v„,  +  Te,-V„gTe  (7) 

and 

E'Te„.  +  2Teii-Te„gTe  (8) 

will  be  the  appropriate  reactions.  The  formation  energies  for  a  tellurium  antisite  and  a 
Hg  vacancy  from  the  tellurium  solid  are  1.63  eV  and  1.15  eV,  respectively.  Although  the 
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difference  in  the  formation  energies  isless  than  when  both  defects  are  referenced  to  the 
mercury  vapor  (~  0.5  eV  compared  to  —  2  eV),  the  gas  phase  entropy  factor  does  not 
enter  into  the  ratio  of  the  defect  concentrations.  Using  the  same  estimate  of  the  entropy 
ratio,  the  defect  concentration  ratio  using  tellurium  solid  as  the  reference  state  is 


Thus,  near  the  inclusions  we  expect  higher  relative  concentration  of  tellurium  antisites, 
as  compared  to  the  rest  of  the  material  equilibrated  with  the  Hg  vapor.  Additionally,  the 
absolute  defect  concentrations  may  differ  substantially  in  the  two  regions  of  the  material. 
A  better  calculation  of  the  vibrational  entropy  is  needed  before  we  can  predict  these  abso¬ 
lute  defect  concentrations.  Differences  in  the  defect  concentrations  arising  from  different 
equilibration  condition  are  a  possible  source  of  spatial  variability  of  the  HgCdTe  material. 
If  the  material  is  not  fully  annealed  to  equilibrium,  for  example  because  of  an  abundance 
of  tellurium  precipitates,  this  history  may  affect  subsequent  processing. 

In  the  above  we  have  discussed  the  defect  energies  for  HgTe  and  applied  them  directly 
to  the  small  x,  system.  Because  we  are  dealing  with  the  native  defects  of  an 

alloy  we  expect  a  number  of  complexities  to  affect  the  above  analysis.  First  the  variation 
of  the  defect  formation  energies  for  vacancies  is  sensitive  to  the  alloy  environment,  in 
particular  for  the  vacancies  of  the  nonsubstituted  species,  such  as  tellurium  in  HgCdTe 
[6].  Even  for  vacancies  of  the  substituted  species,  we  have  found  the  formation  energies 
may  vary  by  several  tenths  of  an  electron  volt.  Because  of  this  variation  in  the  formation 
energy,  the  fraction  of  defective  sites  will  vary  by  as  much  as  a  factor  of  100  from  one 
class  of  sites  to  the  next.  Consider  various  classes  of  Hg  sites  in  ideal  HgCdTe,  which 
can  be  distinguished  to  first  order  by  specifying  the  number  of  Hg  and  Cd  atoms  in  the 
second  neighbor  shell  (the  four  first  neighbors  are  always  tellurium),  Hg]2.jCdj,  with  a 
concentration  given  by  [j].  The  total  vacancy  concentration  is  given  by 

j=: 

where  Ej  is  the  vacancy  formation  energy  for  the  j***  cluster.  The  populations  of  vacancies 
in  each  class  of  cluster,  j,  can  be  expected  to  differ  because  of  differences  in  the  cluster 
populations,  [jj,  and  the  formaiion  energies,  Ej.  Additionally,  the  defect  energy  levels 
may  differ  in  the  various  classes  of  sites,  possibly  leading  to  different  ionization  states  for 
vacancies  in  different  classes  of  sites.  If  the  cations  in  the  alloy  axe  randomly  arranged, 
such  differences  may  be  difficult  to  infer  experimentally.  If,  on  the  other  hand,  the  cations 
are  correlated,  exhibiting  short  range  order,  more  complex  behavior  may  be  present. 
Such  short  range  order  has  been  demonstrated  using  Raman  scattering  [7.  8],  infrared 
reflectivity  [9],  and  nuclear  magnetic  resonance  (lOJ.  In  these  cases,  the  contribution 
to  the  vacancy  densities  from  the  dominant  class  of  clusters  will  be  increased.  Because 
the  studies  finding  short  range  order  focus  on  the  tellurium  centered  5-atom  clusters 
of  the  form  Hg4_„Cdn  rather  than  on  cation  centered  clusters  of  the  form  Hgij.nCtIni 
higher  level  5-atom  cluster-cluster  correlations  must  be  known  to  predict  the  effects  on 
the  vacancy  populations. 

Next  we  examine  the  defects  which  accommodate  excess  Hg  in  the  solid.  The  existence 
region  for  HgCdTe  is  always  tellurium  rich  and  thus  the  native  defects  which  accommodate 
excess  tellurium  are  expected  to  dominate.  For  these  equations  in  Table  I,  /fP  is  given  by 
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A'O^  =Cs‘(kT)-i(2xmH,)-ihJexp(^^)  .  (11) 

A'&^,  =  C5*(kT)-^2rmej)-*h*exp(^^)  ,  (12) 


If  we  assume  the  change  in  entropy  is  comparable  for  all  three  defects,  we  find 


and 


(14) 


tg.l0>14  ,  (15) 

for  T=500  C  and  Phj  =  1  atm.  From  Equations  (14)  and  (15)  we  see  that  the  tellurium 
vacancy,  [Vj^j,  is  a  minority  defect  species.  For  the  pressure  and  temperature  considered, 
the  density  of  Hg  antisiles  is  predicted  to  be  comparable  to  the  density  of  Hg  interstitials. 
Because  the  ratio  of  (Hg-j-^j  to  [Hgjl  is  nearly  unity,  any  errors  in  the  calculation  of  the 
activation  energy  could  push  the  balance  toward  one  side  or  the  other.  Thus  we  must 
depend  on  the  next  generation  of  calculations,  with  the  ASA  removed  and  full  relaxation 
included  plus  a  quantitative  comparison  of  the  entropy  differences  between  the  mercury 
antisite  and  the  tellurium  interstitial,  to  determine  the  dominant  defect  in  this  class. 

As  mentioned  above,  we  have  shown  that  the  tellurium  vacancy  formation  energy 
varies  significantly  with  the  alloy  environment.  Because  the  tellurium  vacancy  is  not 
expected  to  be  a  dominant  defect  in  HgCdTe,  and  the  tellurium  diffuses  by  an  interstitial 
mechanism,  we  do  not  expect  any  measurable  manifestation  of  this  variation.  On  the 
other  hand,  the  Hg  antisite  may  be  the  dominant  Hg-excess  defect,  and  its  formation 
energy  may  vary  significantly  with  the  alloy  environment.  We  are  currently  calculating 
the  magnitude  of  this  variation. 


DIFFUSION 

HgCdTe  exhibits  a  complex  tracer  diffusion  profile,  with  both  a  fast  and  a  slow  branch. 
The  fast  branch  is  attributed  to  a  vacancy  and  interstitial  diffusion  in  parallel  mechanism 
where  the  dominant  diffuser  is  determined  by  the  pressure  and  temperature,  while  the 
slow  component  fits  a  vacancy  and  interstitial  in  series  mechanism  [1 1].  The  activation 
energies  for  the  fast  branch  are  2.10  eV  and  0.61  eV  for  the  vacancy  and  interstitial 
mechanisms,  respectively.  Our  calculated  formation  energy  for  the  mercury  interstitials 
are  0.89  and  0.98  eV  for  the  anion-  and  cation-surrounded  tetrahedral  interstitial  sites, 
respectively,  and  the  experimental  formation  energy  for  the  mercury  vacancy  is  2.24  eV. 
Comparing  these  energies  to  the  experimental  activation  energies  we  find  close  agreement, 
indicating  that  the  migration  energy  contribution  to  the  diffusion  activation  energies  are 
small  for  both  interstitials  and  vacancies. 
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In  a  recent  experiment  on  mercury  diffusion  in  ion-implantation  damaged  HgCdTe, 
an  activation  energy  of  several  tenths  of  an  electron  volt  was  measured  [12].  The  dis¬ 
parate  result  can  be  interpreted  as  a  measure  of  only  the  defect  migration  contribution 
to  the  diffusion  activation  energy,  since  defects  in  excess  of  the  equiUbrium  concentration 
were  Ukely  formed  during  implantation.  It  is  not  evident  that  the  measured  activation 
energy  corresponds  to  the  vacancy  or  the  interstitial  mechanism.  The  conclusion  that 
the  diffusion  activation  energies  are  largely  defect  formation  energies,  with  the  migration 
energies  being  much  smaller,  is  in  agreement  with  the  above  interpretation  of  the  Richter 
and  Kalish  [12]  experiment. 

CONCLUSIONS 

We  have  incorporated  our  calculated  defect  energies  into  the  mass  action  equations 
for  the  neutral  defects  in  HgCdTe.  In  agreement  with  experiment,  we  find  the  mercury 
vacancy  to  be  the  dominant  native  defect  in  tellurium  rich  material.  We  also  find  the 
mercury  antisite  and  interstitial  defect  densitities  to  be  comparable,  although  a  better 
calculation  of  the  vibrational  entropy  is  needed  to  confirm  this  result.  Comparing  the 
defect  formation  energies  to  the  diffusion  measurements  by  Tang  and  Stevenson  [11], 
we  find  agreement  with  their  diffusion  activation  energies  for  both  the  vacancy  and  the 
interstitial  mechanism,  if  we  assume  that  the  migration  energy  is  small.  Further  work  is  in 
progress  to  incorporate  the  defect  charge  states  into  the  calculation,  and  also  to  calculate 
the  fully  relaxed  defect  energies  with  the  full  potential  LMTO. 


The  work  was  supported  by  NASA  contract  NASl-18226,  by  ONR  contract  N00014- 
88-C0096,  and  by  AFOSR  contract  F49620-88-K-0009. 
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We  present  results  for  the  surface  binding  energies  for  HgTe  and  CdTe  that  will  serve  as  input  for 
molecular  beam  epitaxy  grow'th  models.  We  have  found  that  the  surface  binding  energies  are 
surface  orientation  dependent  and  are  not  simply  proportional  to  the  number  of  first-neighbor 
bonds  being  made  to  the  underlying  layer.  Moreover,  because  of  the  possibility  of  charge  transfer 
between  cation  and  anion  surface  states,  one  may  have  large  differences  between  the  binding 
energy  for  the  first  and  the  last  atom  in  a  given  layer,  and  these  differences  will  be  different  for  the 
narrow-gap,  less  ionic  materials  than  for  the  wide-gap,  ionic  matenals.  We  also  find  that  the 
surface  states  associated  with  an  isolated  surface  atom  or  vacancy  are  extended  m  materials  with 
small  gaps  and  small  effective  masses,  and  thus  call  into  question  the  modeling  of  surface  binding 
by  simple  pair  interactions. 


While  considerable  advances  have  been  made  in  recent  years 
in  molecular  beam  epitaxy  (MBE)  growth  of  HgCdTe, 
modeling  of  the  growth  process  is  still  primitive  because  of 
the  complexity  of  the  process.  One  important  input  common 
to  all  growth  models  is  the  binding  energy  of  atoms  to  the 
particular  growth  surfaces.  These  energies  are  usually  ap¬ 
proximated  as  being  equal  to  the  number  of  first-neighbor 
bonds  made  to  the  surface  times  the  energy  per  bond  in  the 
bulk.  We  have  shown '  that  this  is  in  fact  a  very  poor  approxi¬ 
mation,  and  that  the  surface  binding  energies  often  differ 
considerably  from  those  estimates,  and  are  sensitive  to  the 
local  surface  stoichiometry.  In  this  article,  several  implica¬ 
tions  of  these  results  on  the  nature  of  the  growth  surface  of 
HgTe  and  CdTe  and  their  alloys  are  discussed. 

We  have  calculated'  the  surface  binding  energies  in  two 
limits.  The  first  is  ihii  concentrated  limit  in  which  the  atom  is 
added  to  complete  the  surface  layer,  resulting  in  an  ideal 
surface.  The  second  is  the  dilute  limit  in  which  an  isolated 
atom  is  added  to  an  underlying  ideal  surface;  this  limit  corre¬ 
sponds  to  the  initiation  of  a  new  layer.  The  surface  binding 
energies  were  calculated  using  a  tight-binding  Green’s  func¬ 
tion  technique  that  has  been  presented  elsewhere.’ '  Surface 
binding  energies  of  the  constituents  in  these  two  limits  for 
HgTe  and  CdTe  were  calculated  for  the  ( 1 1 1  )B  and  ( 100) 
surfaces.  Results  are  summarized  in  Table  I.  Also  shown  for 
reference  are  the  experimental  bulk  energies  per  bond. 

The  results  in  Table  I  demonstrate  that  surface  binding 
energies  are  not  proportional  to  the  number  of  bonds  made 
to  the  surface.  Because  no  in-plane  bonds  are  made  on  the 
(111)  and  (100)  growth  surfaces,  this  is  evident  by  the  fact 
that  the  binding  energies  in  the  dilute  ( E,, )  and  concentrated 
(£,  )  limits  are  not  in  general  equal,  as  they  would  be  if  the 
simple  proportionality  relationship  held.  Even  when  the  di¬ 
lute  and  concentrated  energies  are  equal,  as  in  the  case  of 
tellurium  on  the  HgTe  ( 1 1 1  )B  surface,  these  energies  are 
not  equal  to  three  times  the  bulk  energy  per  bond.  Although 
the  surface  binding  energies  are  not  linearly  proportional  to 
the  number  of  bonds  being  made  to  the  surface,  we  do  find 
that  averaging  over  the  dilute  and  concentrated  cation  and 
anion  binding  energies  for  a  particular  surface  yields  the  co¬ 
hesive  energy  per  bulk  layer,  as  it  should. 


Differences  in  the  and  can  be  attributed,  in  part,  to 
charge  transfer  effects  on  the  surface.'  ’  Briefly,  because  the 
anion  surface  states  lie  at  lower  energies  than  the  cation  sur¬ 
face  states,  surface  cations  will  always  transfer  charge  to  sur¬ 
face  anions,  until  all  the  surface  anion  states  are  fully  occu¬ 
pied.  Thus,  for  example,  for  cadmium  on  the  ( 100)  surface, 
differences  in  the  binding  energies  result  because  in  the  di¬ 
lute  limit  the  cation  added  to  the  surface  can  transfer  charge 
to  the  surface  anions,  thereby  lowering  the  Cd  surface  bind¬ 
ing  energy,  while  in  the  concentrated  limit,  no  surface  anion 
states  are  available  into  which  to  transfer  charge.  In  con¬ 
trast.  in  the  concentrated  limit  electrons  must  be  promoted 
out  of  the  anion  surface  states  (which  had  been  transferred 
there  from  other  cations  already  present  on  the  surface) 
when  the  cation  completing  the  layer  is  added  Both  of  these 
result  in  a  stronger  binding  of  the  cation  in  the  dilute  limit,  as 
seen  in  Table  I.  This  charge  transfer  effect  will  be  largest  in 
ionic  semiconductors  with  large  band  gap  because  of  the 
large  difference  in  anion  and  cation  surface  state  energies. 

From  Table  I  we  note  that  on  both  the  (111  )B  and  ( 100) 
surfaces  mercury  is  more  weakly  bound  in  the  dilute  limit 
than  in  the  concentrated  limit.  As  a  consequence,  the  initia¬ 
tion  of  a  new  layer  of  mercury  atoms  will  be  more  difficult 
than  the  completion  of  a  mercury  layer,  and  thus  initiation  of 
islands  in  layer-by-layer  growth  will  be  disfavored  with  re¬ 
spect  to  the  completion  of  islands.  The  Te  is  strongly  bound 
to  both  the  ( 1 1 1  )B  and  ( 100)  surfaces,  indicating  that  the 
tellurium-stabilized  growth  will  be  preferred. 

For  CdTe.  we  see  that  the  nucleation  of  a  new  layer  will 
not  be  as  troublesome  as  in  HgTe,  as  indicated  by  the  large 
values  for  E,,  in  Table  I,  while  the  completion  of  the  surface 
will  be  more  difficult.  This  has  an  important  consequence  on 
the  nature  of  the  growing  surface.  Because  E,,  <  £,  for  Cd  in 
CdTe,  the  cations  on  the  surface  prefer  to  nucleate  new  lay¬ 
ers,  as  opposed  to  completing  layers.  This  can  be  interpreted 
as  an  effective  repulsive  surface  interaction  among  the  Cd 
atoms  on  the  surface.  This  may  have  important  conse¬ 
quences  on  the  growth  of  this  and  other  wide-gap  ionic  mate¬ 
rials  such  as  ZnTe.  Because  the  surface  atoms  prefer  not  to 
sit  adjacent  to  one  another,  there  is  the  possibility  of  the 
atoms  on  the  surface  forming  a  superlattice  with  the  surface 
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Table  I.  Surface  binding  energies  for  P;,Te  and  CdTe  (eV) 


(IIDB 

(100) 

Bulk 

Dilute 

Concen¬ 

trated 

Dilute 

Concen¬ 

trated 

HgTe 

Hg 

~0  0 

-  0  3 

~0  0 

-  10 

-  0  82 

Te 

-  2.8 

-  2.8 

-  24 

-  3.1 

CdTe 

Cd 

—  2.2 

-  13 

-  2  1 

-  0  1 

-  1  10 

Te 

-  5.2 

-  2  1 

-  5.0 

-  2  t) 

vacancies.  This  phenomenon  has  been  predicted  and  ob¬ 
served  on  GaAs,  where  the  same  mechanism  is  responsi¬ 
ble.^ '  The  nature  of  this  surface  superlattice  will  depend  on 
the  particular  surface,  the  magnitude  of  the  energy  differ¬ 
ences  between  Ej  and  ,  and  the  temperature  of  the  grow¬ 
ing  surface. 

Because  the  surface  binding  energies  are  dependent  on  the 
surface  stoichiometry  in  so  far  as  it  controls  the  states  avail¬ 
able  for  charge  transfer,  we  expect  the  contributions  to  sur¬ 
face  binding  energies  to  change  relatively  abruptly  at  some 
specific  surface  coverage.  As  an  example  we  consider  the 
addition  of  Cd  to  the  ( 100)  surface  of  CdTe.  As  discussed 
above,  w  hen  an  isolated  Cd  atom  is  added  to  the  surface,  the 
binding  energy  is  lowered  because  of  the  charge  transfer  to 
adjacent  Te  atoms,  w  hile  when  Cd  is  added  a  to  nearly  com¬ 
plete  Cd  surface,  the  binding  energy  does  not  benefit  from 
such  a  charge  transfer  because  there  are  no  empty  Te  surface 
states.  At  50%  surface  coverage,  the  number  of  Cd  and  Te 
surface  states  will  be  equal,  with  the  lower-energy  Te  states 
being  completely  filled  and  the  higher-lying  Cd  states  being 
empty.  Now  we  consider  an  arbitrary  intermediate  surface 
coverage  to  determine  if  the  Cd  surface  binding  energy  will 
benefit  from  a  charge  transfer  to  adjacent  Te.  If  the  surface 
coverage  is  below  50%,  empty  Te  states  will  still  be  available 
for  charge  transfer,  while  above  50%  coverage,  all  Te  states 
will  be  already  full.  Thus,  near  50%  we  expect  the  value  of 
the  surface  binding  energies  to  change  rather  abruptly,  with 
binding  energies  below  50%  coverage  being  closer  to  E^  and 
binding  energies  for  coverages  above  50%  being  closer  to  E^. 
The  situation  is  similar  for  other  polar  surfaces.  For  exam¬ 
ple,  for  addition  of  Te  to  the  ( 1 1 1  )B  surface  the  changeover 
between  dilute-like  and  concentrated-like  binding  energies 
occurs  at  75%  coverage.  This  is  because  the  density  of  sur¬ 
face  states  per  cation  on  the  surface  is  three  times  the  density 
of  surface  states  per  anion.  Thus,  when  the  surface  is  75% 
anions,  the  density  of  cation  and  anion  surface  states  will  be 
equal,  and  full  charge  transfer  will  occur.  This  is  similar  to 
the  results  found  by  Chadi^  and  Tong  et  al.'  for  GaAs. 

Because  the  charge  transfer  depends  on  the  band  gap  of 
the  material,  we  expect  the  behavior  in  HgCdTe  and 
HgZnTe  with  band  gaps  in  the  infrared  to  have  effective 
attractive  surface  interactions.  This  is  consistent  with  pre¬ 
liminary  calculations  using  the  supercells/slab  method.'’ 
Thus,  at  sufficiently  low  temperatures,  although  higher  than 
temperatures  for  which  surface  diffusion  is  too  slow  for  sur¬ 
face  equilibration,  the  surfaces  will  grow  via  the  formation  of 
smooth  islands  of  like  atoms.  As  the  surface  temperature  is 


raised,  entropy  will  dominate  and  the  surface  will  be  rough, 
with  smooth  islands  being  replaced  by  disordered,  randomly 
arranged  atoms. 

The  lateral  extent  of  the  surface  states  produced  by  an 
isolated  surface  atom  or  vacancy  is  found  to  differ  for  HgTe 
and  CdTe.  This  has  been  demonstrated  using  the  same  tight- 
binding  Hamiltonian  in  a  slab  calculation®  and  in  a  Green  s 
function  calculation.  In  the  slab  calculation,  surfaces  are 
modeled  by  constructing  multiple  layers  of  the  semiconduc¬ 
tor  and  vacuum,  where  the  number  of  vacuum  layers  is  large 
enough  to  completely  decouple  the  two  surfaces  and  the 
number  of  semiconductor  layers  is  chosen  large  enough  so 
that  the  center  layer  looks  bulk  like.  In  the  Green  s  function 
method,  a  truly  semiinfinite  surface  can  be  created.  Al¬ 
though  the  slab  method  does  not  mimic  a  real  surface  as  well 
as  the  Green's  function  calculation,  it  permits  the  calcula¬ 
tion  of  the  formation  energy  for  a  periodic  array  of  defects 
To  examine  the  in-plane  coupling  of  the  vacancies,  a  regular 
array  of  Hg  atoms  was  removed  from  the  (lll)A  HgTe 
surface,  where  each  surface  vacancy  created  was  completely 
surrounded  by  atoms.  This  removal  energy  per  atom  is  ex¬ 
pected  to  be  equal  to  —  .  if  the  wave  functions  are  local¬ 

ized  at  the  removal  site.  However,  we  find  that  the  energy 
required  ( per  atom  from  a  unit  cell  of  four  atoms )  to  remox  e 
a  regular  array  of  Hg  atoms  from  the  (111)  HgTe  A  surface 
differs  substantially  from  —  E^ .  When  the  size  of  the  super¬ 
cell  is  increased  to  include  nine  atoms,  the  calculated  energy 
(per  atom)  is  larger  than  the  previous  one  but  still  smaller 
than  f, .  This  is  because  the  created  surface  vacancies  are  far 
from  each  other  and  consequently  couple  less  when  com¬ 
pared  to  the  previous  case.  However,  when  these  calcula¬ 
tions  were  repeated  for  removal  of  cations  from  CdTe  and 
ZnTe  surfaces,  the  removal  energies  calculated  with  nine 
atoms  per  unit  supercell  agreed  exactly  with  £, .  It  suggests 
that  the  surface  wave  function  is  well  extended  on  HgTe 
surfaces,  but  terminates  near  the  third  neighbor  on  CdTe 
and  ZnTe  surfaces.  Whenever  surface  wave  functions  have  a 
large  spatial  extent,  coupling  substantially  to  one  another, 
the  removal  energies  per  atom  will  be  affected. 

The  difference  in  spatial  extent  on  HgTe  and  CdTe  sur¬ 
faces  can  be  understood  from  the  complex  band  structure  of 
these  materials.  The  complex  band  structure  is  the  relation¬ 
ship  between  real  energy  and  complex  states’  and  connects 
the  real  bands  through  the  forbidden  gap  in  the  complex 
wave  vector  plane.  In  the  region  of  fundamental  gap,  the 
complex  bands  start  from  valence  and  conduction  band 
edges  and  meet  at  a  branch  point  in  the  complex  plane.  The 
extent  of  the  wave  function  decreases  exponentially  with  the 
magnitude  of  the  imaginary  part  k,  of  this  branch  point. 
When  the  hole  ma.ss  is  much  larger  than  electron  mass,  it  can 
be  found  that  k,  is  directly  proportional  to  conduction  band 
effective  mass  and  band  gap.  Consequently,  for  HgTe  in 
which  both  band  gap  and  electron  mass  are  very  small,  the 
surface  wave  functions  are  well  extended.  In  the  case  of 
CdTe  and  ZnTe,  with  large  gap  and  large  effective  mass,  the 
calculated  k,  is  large  and  wave  function  is  less  extended. 

We  conclude  by  noting  several  features  of  the  surface 
binding  energies  that  must  be  incorporated  into  any  MBE 
growiii  model.  First,  surface  binding  energies  are  surface 
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orienlation  dependent  and  are  not  simply  proportional  to  the 
number  of  first-neighbor  bonds  being  made  to  the  underly¬ 
ing  layer.  Moreover  because  of  the  possibility  of  charge 
transfer  between  cation  and  anion  surface  states,  one  may 
have  <  £,.  or  >  E^ ,  where  the  former  will  occur  mostly 
for  wide-gap  ionic  compounds  and  the  latter  for  narrow-  (or 
zero)  gap,  less  ionic  compounds.  Finally,  because  the  sur¬ 
face  states  associated  with  an  isolated  surface  atom  or  va¬ 
cancy  are  extended  in  materials  with  small  gaps  and  small 
effective  masses,  the  approximation  of  surface  binding  using 
simple  pair  interactions  is  highly  suspect.  We  expect  that  the 
narrow-gap  semiconductors  HgCdTe  and  HgZnTe  will  be¬ 
have  more  like  HgTe  than  CdTe 
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The  energies  to  remove  atoms  from  various  surfaces  of  semiconductors  have  been  calculated  using  a  new 
Green's  function  formalism.  By  comparing  the  energy  to  remove  atoms  from  a  nearly  completed  (con¬ 
centrated)  surface,  Ec,  and  from  a  nearly  empty  (dilute)  one,  Ed.  we  can  deduce  the  surface  interaction 
energy.  We  find  that  Ec  and  Ed  differ  significantly  even  when  there  are  no  first-neighbor  bonds.  Because 
an  important  contribution  comes  from  charge  transfer.  Ec  -  Ed  is  positive  (indicating  attractive  interac¬ 
tions)  for  pure  covalent  and  narrow  gap  materials,  and  may  become  negative  (repulsive  surface  inter¬ 
actions)  for  wide-gap  ionic  materials.  For  attractive  interactions.below  a  critical  temperamre,  Tc,  the 
equilibrium  surface  atoms  collect  in  islands  and  two-dimensional  layer-by-layer  growth  can  occur.  Above 
Tc,  rough  three-dimensional  growth  is  the  rule.  When  Ec  -  Ed  is  negative,  there  is  also  a  critical  tempera¬ 
ture,  TJ,  below  which  the  atoms  and  vacancies  arrange  into  regular  arrays.  Calculated  energies  and 
temperatures  are  given  for  Si,  GaAs,  CdTe  and  HgTe  on  the  (1 1 1),  (U  i),  (1 10),  and  (100)  surfaces. 


1.  INTRODUCTION 

Propenies  of  semiconductor  surfaces  during 
epitaxial  growth  are  sensitively  dependent  on  the 
nature  of  the  interactions  between  surface  atoms. 
When  the  interaction  between  the  atoms  is  attrac¬ 
tive,  the  surface  grows  with  formation  of  islands  at 
temperatures  below  a  critical  temperature,  Tc  . 
known  as  roughness  transition  temperature.  How¬ 
ever,  when  the  interaction  between  the  atoms  is 
repulsive,  the  surface  undergoes  an  order-disorder 
transition.  At  growth  temperatures  below  the 
order-disorder  transition  temperature,  TJ,  surface 

atoms  and  vacancies  arrange  themselves  in  a 
superlattice  pattern  that  minimizes  the  number  of 
atom-atom  pairs.  Regardless  of  the  nature  of  the 
interaction,  the  sites  are  occupied  uniformly  at 
growth  temperatures  well  above  the  corresponding 
critical  temperature. 

Growth  by  low-temperature  epitaxial  methods 
such  as  atomic-layer  epitaxy  (ALE)  and  molecular- 
beam  epitaxy  (MBE)  is  believed  to  take  place 
under  nonequilibrium  growth  conditions.  L2  How¬ 
ever,  when  the  surface  relaxation  rate  is  much 
larger  than  the  arrival  rate,  each  surface  atom  will 
have  enough  time  to  minimize  its  free  energy 


before  the  arrival  of  the  next  atom,  so  that  a  ther¬ 
modynamic  model  can  be  expected  to  qualitatively 
describe  the  morphology  of  the  growth  surface. 
Statistical  models  with  various  degrees  of  approxi¬ 
mations  are  frequently  used  to  smdy  the  nature  of 
the  surface.3-5  In  this  paper,  we  extend  the  studies 
to  consider  the  surface  arrangement  of  vacancies  in 
a  zinc -blende  lattice.  We  obtain  the  strength  of  a 
pair  interaction  on  various  ideal  semiconductor 
surfaces  using  the  tight-binding  Green's  function 
method.6-7  The  pair  interaction  energies  are  then 
used  in  a  thermodynamic  model  to  obtain  the 
minimized  free  energy.  The  qualitative  nature  of 
various  surfaces  is  discussed. 

2.  PAIR  INTERACTION  ENERGIES 

The  surface  sublimation  energies  for  various 
surface  orientations  are  obtained  using  the  ught- 
binding  Green's  function  (GF)  method.  Because 
the  details  of  the  calculational  procedure  have  been 
published  elsewhere,^*  this  discussion  will  be 
brief. 

We  define  surface  sublimation  energy  (SSE)  as 
the  difference  between  the  total  energy  of  a  surface 
plus  a  free  atom  and  that  of  the  surface  with  the 
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atom  bonded  to  it.  In  this  convention,  a  positive 
SSE  means  that  atoms  will  require  energy  to  go 
from  surface  to  vapor.  The  calculation  of  SSE  pro¬ 
ceeds  in  two  steps.  First,  the  GF  for  an  ideal  atom¬ 
ically  smooth  surface  is  calculated  in  terms  of  bulk 
GF.  using  Dyson's  expansion.  The  modified  GF 
with  an  atom  added  to  or  removed  from  the  surface 
is  then  calculated  in  terms  of  surface  GF.  The 
change  in  density  of  states  introduced  by  perturba¬ 
tions  is  used  to  obtain  the  electronic  energy  contri¬ 
bution  to  SSE. 

The  perturbation  to  the  surface  that  includes  ♦he 
removal  of  an  atom,  leaving  a  surface  vacancy,  is 
referred  to  as  surface  sublimation  from  a  concen¬ 
trated  surface  and  the  corresponding  SSE  is  deno¬ 
ted  as  Ec-  This  removal  entails  breaking  bonds 
witli  the  layer  below  and  in-plane  and,  sometimes, 
breaking  second-neighbor  surface  bonds.  The 
perturbation  to  the  surface  that  involves  the  re¬ 
moval  of  an  atom  and  leaves  an  ideal  flat  surface 
is  referred  to  as  sublimation  from  a  dilute  surface 
and  the  corresponding  SSE  is  denoted  as  Ed-  Here, 
the  removal  necessitates  breaking  bonds  with  the 
layer  below  only. 

In  this  model,  it  is  clear  that  the  effective  atom- 
atom  pair  interaction  energy,  eo<  between  atoms  on 
the  surface  is  simply  (Ec  -  EdV  Tjo,  where  T|o  is  the 
number  of  near-neighbor  surface  bonds.  A  posi¬ 
tive  (negative)  eo  implies  an  attractive  (repulsive) 
interaction  between  surface  atoms.  Calculated 
values  of  Ec  and  Ed  are  given  in  Table  I  for  (1 1 1), 
(iii)  and  (100)  surfaces  of  silicon,  GaAs,  CdTe 
and  HgTe.  Notice  that  the  (111)  surface  can 
terminate  either  with  triply  bonded  cations  or  with 
singly  bonded  anions.  The  opposite  bonding 
arrangement  is  found  in  the  [1  il ]  direction. 

Intuitively,  one  would  expect  Ec  >  Ed-  The 
removal  of  an  atom  from  the  concentrated  limit,  in 
addition  to  breaking  interlayer  bonds,  requires  the 
breaking  of  in-plane  surface  bonds,  which  are  first- 
neighbor  bonds  for  the  (110)  surface  and  second- 
neighbor  bonds  for  the  other  surfaces.  Even  in  the 
absence  of  first-neighbor  bonds  on  the  surface,  the 


surface  dangling  bo'ids  in  the  concentrated  limit 
interact  to  form  a  partially  filled  band,  which  low¬ 
ers  energy.  When  a  surface  atom  is  removed,  the 
removal  of  an  electron  from  these  broadened  sur¬ 
face  states  will  in  general  require  more  energy  than 
if  the  dangling  states  were  isolated,  as  in  the  dilute 
case.  As  seen  from  Table  1,  this  conventional 
interpretation  explains  the  trends  observed  in  sili¬ 
con,  HgTe,  and  in  GaAs  in  most  cases.  However, 
for  other  cases,  we  find  that  E-  is  smaller  than  Ed- 
In  order  to  understand  the  mechanisms  that 
drive  some  semiconductors  to  have  Ec  <  Ed>  we 
examine  the  nature  of  the  anion  and  cation  surface 
states.  As  an  illustration,  we  choose  the  (HI) 
surface  of  CdTe  where  Ec  is  less  than  Ed-  The 
surface  density  of  states  for  two  ideal  surfaces,  the 
Cd-  and  Te-terminated  (iii),  are  shown  in  Figure 
1.  For  an  unreconstructed  surface,  some  of  the 


FIGURE  1 

Density  of  dangl'mg  hybrid  states  of  Te  (dashed)  and 
Cd  (solid)  for  (tii)  oriented  CdTe.  Em  is  the  highest 
occupied  level  for  Cd  terminated  (iii)  surface. 

surface  states  normally  lie  in  the  fundamental  gap. 
Compounds  have  both  cation-  and  anion-derived 
hybrid  surface  states,  but  the  cation-derived  states 
most  often  lie  higher  in  energy.  The  energy 
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exception.  Owing  to  an  increased  ionicity  in  II-VI 
compounds,  the  energy  separation  between  the 
dangling  states  is  large.  As  illustrated  above,  the 
effect  of  the  charge  transfer  is  substantial  in  CdTe 
and  in  most  cases  causes  Ec  to  be  less  than  Ed- 
Although  HgTe  is  a  II-VI  compound,  the  charge 
transfer  does  not  occur  because  it  is  a  semimetal 
with  no  forbidden  gap.6.7 

From  Table  1,  we  conclude  that  Si,  HgTe,  and 
most  GaAs  surfaces  will  exhibit  smooth  to  rough 
transitions  while  most  of  the  CdTe  surfaces  and  the 
singly  bonded,  gallium-terminated  (111)  surface 
will  undergo  an  order-disorder  transition.  During 
growth,  the  actual  nature  of  a  surface  will  be  deter¬ 
mined  by  the  growth  temperature  with  respect  to 
the  critical  temperature. 

3.  RESULTS  AND  DISCUSSION 

In  ALE,  grown  from  a  free  atom  vapor  phase, 
only  one  type  of  atom  species  (anion  or  cation)  is 
present  at  any  given  time.  The  atom-atom  pair 
energies  calculated  in  Section  2  can  be  used  in  a 
single-layer  growth  model  to  obtain  minimized 
free  energy  as  a  function  of  layer  coverage,  x.  At 
temperatures  below  Tc,  the  growth  will  take  place 
with  the  formation  of  islands,  and  atomic  concen¬ 
tration  in  those  islands  will  be  given  by  the  location 
of  multiple  minima  in  the  curve  of  x  versus  free 
energy.  At  temperatures  well  below  Tc,  the 
minima  lie  near  x  =  0  and  jc  =  1.  In  these  cases, 
islands  on  the  surface  are  nearly  fully  occupied  and 
the  remaining  area  is  nearly  empty.  As  the  surface 
coverage  is  increased,  the  islands  grow  in  size  with 
a  fixed  vacancy  concentration  until  the  layer  is 
fully  grown,  i.e.,  two-dimensional,  layer-by-layer 
growth. 

Because  the  (100)  surface  is  a  square  lattice,  the 
exact  value  of  Tc  =  0.57  eo/ke  given  by  Onsagar^ 
is  used,  where  ke  is  Boltzmann’s  constant.  For  the 
hexagonal  lattice,  observed  in  the  (111)  orien- 
'ation,  the  exact  value  U  of  Tc  is  0.9 1  eo  /  ke.  The 
calculated  Tc  are  given  in  Table  2  for  Si,  HgTe, 
and  most  GaAs  surfaces.  Tc  is  much  larger  than 


the  typical  growth  temperature  for  these  elements 
and  compounds  [except  for  the  (Hi)  Te- 
terminated  HgTe  surface].  Thus,  ALE  growth  of 
these  materials  is  dominated  by  the  formation  of 
nearly  perfect  islands. 

When  the  interaction  between  surface  atoms  is 
repulsive,  as  in  a  Ga-terminated  (Hi)  GaAs 
surface  and  all  CdTe  surfaces,  the  free  energy  of  a 
homogeneous  surface  always  remains  negative 
with  one  minimum.  However,  these  surface 
energies  may  reduce  the  free  energy  with  ordered 
arrangement  of  atoms  and  vacancies.  The  nature 
of  the  superlattice  arrangements  below  7’J  and  the 
area  of  different  superlattices  depend  on  the  growth 
temperature  and  surface  coverage.  For  example,  at 
temperatures  well  below  7’?  for  a  square  lattice  of 
50%  coverage,  the  superlattice  pattern  will  look 
like  a  checkerboard,  where  every  near-neighbor 
site  of  an  atom  (vacancy)  is  occupied  by  vacancies 
(atoms).  This  type  of  arrangement  can  be 
envisioned  for  a  monolayer  growth  on  (100) 
orientations.  T?  calculated  using  Onsagar's  result 
for  (100)  surfaces  and  that  using  an  exact  value  12 
of  0.35  eo/ke  for  (111)  orientations  are  given  in 
Table  2.  We  see  that  the  calculated  temperatures 
are  well  above  the  typical  ALE  or  MBE  growth 
temperatures  for  these  compounds  [except  for  the 
(lil)  Ga-terminated  GaAs  surface].  Hence,  the 


TABLE  2.  Transition  temperatures  in  Kelvins. 
Parentheses  show  number  of  bonds  broken  with 
layer  below. _ 


Orien¬ 

tation 

Re 

moved 

atoms 

Si 

TcK 

GaAs 

TcK 

CdTe 

TcK 

HgTe 

TcK 

(111) 

a(l) 

3700 

2600 

500 

3500 

c(3) 

3900 

200 

*1400 

2400 

(TiT) 

a(3) 

3900 

700 

*2100 

0 

c(l) 

3700 

410 

*600 

900 

(100) 

a{2) 

3200 

2200 

*4000 

1200 

c(.21 

3200 

170C 

*3300 

2200 

•Order-disorder  transition  temperature 
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partly  filled  growth  surface  is  expected  to  exhibit 
superlattice  arrangements  with  atoms  and 
vacancies. 

In  MBE  where  both  species  are  present,  simul¬ 
taneous  growth  of  an  anion-cation  double  layer  can 
be  expected.  We  extend  our  model  to  study  the 
double-layer  growth  of  semiconductors  with  zinc- 
blende  lattice.  The  intralayer  pair  interaction  ener¬ 
gies  deduced  for  the  single-layer  growth  model  are 
still  valid.  The  interlayer  anion-cation  pair  inter¬ 
action  energies  are  simply  Edhw,  where  rii  is  the 
number  of  interlayer  near-neighbor  sites.  We 
define  an  anion-stabilized  growth  to  be  the  case 
where  relative  anion  and  cation  fluxes  are  such  that 
any  exposed  surface  atoms  are  anions.  When  the 
interaction  between  the  atoms  in  the  upper  layer 
and  that  in  the  lower  layer  is  strong  (compared 
with  kTg),  the  surfaces  always  grow  in  the  smooth 
limit.  (100)  surfaces  of  HgTe  and  GaAs  and  all 
surfaces  of  Si  are  predicted  to  fall  into  this  categ¬ 
ory.  However,  when  the  interaction  between  adja¬ 
cent  layers  is  not  always  strongly  attractive,  as  in 
the  case  of  (Hi)  HgTe  and  (111)  GaAs  surfaces, 
the  nature  of  growth  depends  on  whether  the 
surface  is  anion  or  cation  stabilized. 

The  free  energy  calculated  in  quasi-chemical 
approximation  (QCA)13  at  a  growth  temperature  of 
i85'C  for  Te-  and  Hg-stabilized  (lii)  surfaces  is 
plotted  respectively  in  Figure  2(a)  and  (b)  as  a 
function  of  Te  and  Hg  surface  concentration.  Each 
comer  of  this  plot  represents  the  surface  fully 
covered  by  the  element  noted  there.  In  ALE, 
growth  takes  place  along  the  sides.  Whenever  the 
minimum  occurs  between  the  two  comers,  that 
surface  is  predicted  to  grow  in  the  rough  limit 
where  atoms  and  vacancies  randomly  occupy  the 
surface  sites.  Whenever  the  minima  occur  near  the 
comers,  the  surface  is  predicted  to  grow  in  the 
smooth  limit  with  formation  of  nearly  perfect 
islands.  In  MBE  where  both  species  are  present  in 
double-layer  stochastic  growth  of  compounds,  the 
above  arguments  are  applied  to  the  free-energy 
curve  along  the  diagonal  line  connecting  the  origin 


and  upper-right  comer.  Accordingly,  we  see  from 
Figure  2  that,  while  ALE  growth  of  the  Hg  and  Te 


(■) 


Al 


FIGURE  2 

(a)  Free  energy  for  Te-stabilized  surface  (i f  i) 
surface  of  HgTe  at  ISS'C  temperature  as  a 
function  of  Hg  and  Te  surface  concentrations. 

(b)  Free  energy  for  Hg-stabilized  surface  (iii) 
surface  of  HgTe  at  ISS'C  temperature  as  a 
function  of  Hg  and  Te  surface  concentrations. 

layer  takes  place  in  the  smooth  and  rough  limits, 
respectively,  MBE  growth  of  the  Te-stabilized 
surface  takes  place  in  the  smooth  limit.  This  is 
because  the  pair  energy  for  Te  (upper  layer)-Hg 
(lower  layer)  is  strong,  0.9  eV.  However  the  Hg- 
stabilized  surface  grows  in  the  rough  limit  because 
the  Hg  in  the  upper  layer  is  weakly  attracted  to  Te 
below  (nearly  0  eV  )  and  atoms  do  not  congregate 
into  islands  to  maximize  the  cadon-anion  pairs. 
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4.  CONCLUSIONS 

We  have  calculated  intralayer  and  interlayer 
atom-atom  pair  interaction  energies  using  a  tight- 
binding  Green’s  function  method.  The  calculated 
values  imply  that  sublimation  energies  do  not 
increase  linearly  with  number  of  broken  bonds. 
Also,  the  sublimation  energy  depends  on  the  cov¬ 
erage.  The  possibility  of  repulsive  surface  interac¬ 
tions  due  to  a  charge  transfer  mechanism  is  predic¬ 
ted.  These  energies  are  then  used  in  a  thermodyna¬ 
mic  model  to  study  the  nature  of  growing  surfaces. 
As  the  growth  temperature  is  varied,  attractive 
interaction  between  surface  atoms  leads  to  smooth- 
to-rough  transition,  and  Si,  HgTe,  and  most  GaAs 
surfaces  are  calculated  to  fall  in  this  category.  The 
repulsive  interactions  lead  to  superlattice  ordered- 
to-disordered  transition,  and  (III)  GaAs  and  most 
CdTe  surfaces  are  expected  to  be  in  this  category. 
Smooth  or  rough  surface  growth  is  found  to  be 
decided  by  in-plane  interactions  in  an  ALE  growth 
and  is  also  influenced  by  interplane  interactions  in 
double-layer  MBE  growth.  In  our  calculations  of 
pair  energies,  we  considered  only  ideal  surfaces. 
The  critical  temperatures  and  other  conclusions  in 
this  paper,  can,  at  best,  be  expected  to  represent  the 
trends.  Nonidealities  such  as  reconstruction, 
dimerization,  relaxation  of  adatoms  and  of  surface 
can  lower  surface  energy  substantially.! ‘^•15  For 
quantitatively  more  accurate  values,  the  effects  of 
these  mechanisms  on  the  values  of  (£c  -  £d)  and 
statistical  approximations  that  are  better  than  QCA 
with  pairs  will  have  to  be  included. 
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ABSTRACT 

We  have  examined  the  prospects  of  Ini.xTlxSb  as  a  long- wave  infrared  (LWIR) 
material  for  focal-plane  array  applications.  We  find  there  is  a  near  bond-length  match 
between  InSb  and  TlSb  in  the  zinc  blende  phase,  the  cohesive  energy  is  typical  of  InSb, 
and  the  concentration  x  needed  to  narrow  the  gap  into  the  LWIR  range  is  about  10  ±  5%. 
The  calculated  phase  diagram  shows  a  wide  temperature  range  where  this  alloy  is  itable. 
While  this  diagram  shows  that  the  alloy  cannot  ^  grown  from  the  liquid  phase,  growth 
should  be  possible  in  a  vapor-phase  process. 


We  have  investigated  the  prospects  of  Ini-xTl^Sb  as  an  IR  detector  material  with  a  gap  that  can 
be  adjusted  to  zero  just  as  that  of  Hgi-xCdxTe.  In  the  search  for  a  ni-V  compound -based  alternative  to 
Hgi-xCdxTe  for  long- wave  infrared  (LWIR)  focal-plane  arrays, ^>2  a  number  of  materials  are  currently 
under  investigation,  including  InAsi-xSbx  based  layer  superlatdces^  and  Ali.xGaxAs  based  quantum  well 
structures.  2  InSb  is  often  used  in  mid-wave  irvfrared  (MWIR)  devices,  but  its  bandgap  is  too  wide  to 
respond  in  the  LWIR.  The  alloy  InSbi-xBix  has  been  examined^’'*  as  one  with  an  adjustable  narrower  gap 
that  might  serve  the  purpose.  However,  BiSb  does  not  form  in  the  zinc  blende  (ZB)  structure^  and  the 


1  G.C.  Osbourn,  L.R.  Dawson,  R.M.  Biefeld,  T£.  Zipperian,  IJ.  Fritz,  and  B.L.  Doyle,  /.  Vac.  Sci.  Technol.  AS, 
3150(1987). 

^  B  J.  Levine,  C.G.  Bethea,  K.G.  Glogovsky,  J.W.  Stayi,  and  R£.  Leibenguth,  Semicond.  Sci.  Technol.  6,  Cl  1 
(1991). 

3  J.  Zclko  and  J.E.  Greene,  Appl.  Phys.  Lett.  33, 254  (1978);  J.  Appl.  Phys.  51, 1549  (1980). 

^  M.A.  Berding,  A.  Sher,  A.-B.  Chen,  and  W£.  Miller,/.  Appl. Phys.  63, 107  (1988). 

3  O.  Made'-jng,  Physics  of  III-V  Compounds,  John  Wiley  and  Sons,  New  York,  Ch.  1  (1964). 
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alloy  is  not  miscible  in  equilibrium.  It  has  been  prepared  by  Ag  ion-assisted  epitaxy,  but  no  practical 
growth  system  has  evolved  from  this  work. 

We  have  theoretically  examined  Ini.xTlxSb  based  on  a  combination  of  full-potential  linearized- 
muffin-tin  orbital  approximation  (FP-LMTO)  whose  accuracy  is  limited  only  by  the  local  density 
approximation  (LDA)^  and  a  parametrized  calculation  of  the  Ini-xTlxSb  liquid-state  free  energy.  Details 
of  these  calculations  will  be  presented  elsewhere.  Here  we  summarize  only  the  results  relevant  to  the  IR 
community. 

Table  1  contains  the  atomic  volume  Q.  cohesive  energy  per  cation  E,  bulk  modulus  B,  one  shear 
coefficient  Ci  1-C12.  and  the  bandgap  Eg  for  the  materials  HgTe.  CdTe,  InSb,  and  TlSb.  For  InSb  and 
TlSb  the  calculations  are  presented  for  both  the  ZB  and  CsG  crystal  strucmres.  Other  structures  were 
also  examined,  but  their  cohesive  energies  are  small.  The  HgTe  and  CdTe  ZB  numbers  were  done  for 
comparison. 

Table  1 

Calculated  and  experimental  optical  and  staictural  properties  for  CdTe,  HgTe,  InSb,  and  TlSb  in  a  AB 
lattice  at  0  K.  Atomic  volume  Q  is  in  A,  cohesive  energy  per  cation  E  and  bandgap  Eg  are  in  eV;  C 1  i-C  12 
and  bulk  modulus  B  are  in  10^2  erg/cm3.  Calculated  values  for  InSb  in  the  CsCI  and  TlSb  in  both  ZB  and 
CsCI  lattices  are  shown.  Column  Adj  adjusts  the  calculated  values  of  TlSb  in  the  ZB  structure  for  errors  in 
the  LDA. 


CdTe 

HgTe 

InSb 

TlSb 

ZB 

Calc 

Expt 

Calc 

Expt 

Calc 

Expt 

CsCI 

ZB 

Adj 

CsCI 

33.40 

34.2 

33.4 

34.1 

33.8 

25.6 

36.4 

27.1 

E 

5.28 

4.40 

3.24 

6.45 

5.60 

6.08 

5.70 

5.79 

C11-C12 

0.20 

0.17 

0.19 

0.18 

0.31 

0.31 

0.17 

B 

0.44 

0.42 

0.44 

0.47 

0.45 

0.48 

0.72 

0.38 

0.49 

Eg 

0.54 

1.60 

-0.95 

-0.30 

-0.46 

0.26 

-2.20 

-1.5 

A  number  of  important  points  can  be  deduced  from  Table  1 . 

•  For  HgTe,  CdTe.  and  InSb  where  experimental  numbers  are  available,^  the  calculated 
structural  properties  Q,  B,  and  C11-C12  agree  well  vrith  experiment.  E  values  are  generally 
slightly  high,  because  the  free-atomic  energies  are  not  done  weU  in  LDA.  However, 
differences  between  solid  energies  are  reliable.  The  bandgaps  are  always  too  small  in  LDA, 
but  the  reasons  are  well  understood  and,  once  again,  reliable  adjustments  can  be  made. 

•  In  agreement  with  experiment,^  we  find  the  TlSb  ground  state  is  the  CsQ  structure,  but  the 
energy  difference  with  the  ZB  structure  is  small — 88  meV.  The  energy  difference  for  InSb 
favors  the  ZB  structure  by  380  meV. 


^  M.  Methfessel  and  M.  van  Schilfgaarde,  unpublished. 

^  K.-H.  Hellwege,  editor-in-chief  Landholi-Bornstein:  Numerical  Data  and  Functional  Relationships  in  Science 

and  Technology.  Vols.  17  &  22,  Springer-Verlag,  Berlin,  Heidelberg,  New  York  (1982). 
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•  For  the  ZB  structure,  the  atomic  volumes  of  InSb  and  TlSb  are  small,  corresponding  to  a  bond 
length  difference  of  2.1%.  Thus,  they  are  matched  nearly  as  well  as  Hgi-xCdxTe. 

•  The  adjusted  zero-temperature  bandgaps  of  the  InSb  (0.26  eV)  and  TlSb  (-1.4  cV)  are  such 
that  the  concentration  needed  for  a  0. 1  e V  gap  is  x  =  0.09.  At  77  K,  we  estimate  x  <  0. 15 
will  bring  the  gap  into  the  LWIR  range.  These  numbers  are  the  most  uncertain  of  those 
presented. 

•  Th?  predicted  cohesive  energy  per  cation  of  the  Ino.9Tlo.1Sb  is  6.38  eV,  compared  to  5.05  eV 
for  Hgo.78Cdo22Te:  it  is  26%  more  strongly  bound. 

•  Even  assuming  no  bandgap  bowing,  dEg/dx  is  comparable  for  InTlSb  and  HgCdTe  alloys. 

Thus,  the  Ini.xTlxSb  alloy  looks  like  a  viable  LWIR  material  if  it  can  be  prepared  in  the  ZB  structure 
with  high  enough  concentration.  We  address  that  issue  next. 

Our  calculated  phase  diagram  of  Ini-xTlxSb  is  shown  in  Figure  1.  The  lower  part  of  the  diagram, 
that  below  the  solidus  curves,  is  based  on  our  FP-LMTO  results  and  is  likely  to  be  accurate.  The  solidus 
and  liquidus  curves  are  based  on  some  liquidus  properties  that  have  been  projected  from  the  measured 
properties  of  InSb.  and  a  theoretical  estimate  of  the  melting  point  of  TlSb  in  the  CsQ  structure.  The 
melting  px3int  of  InSb  is  808  K.^  the  calculated  eutectic  temperature  and  composition  are  687  K  and  0.68, 
the  point  at  which  the  ZB  miscibility  curve  intersects  its  solidus  is  at  700  K  and  0. 16,  the  point  at  which 
the  CsCl  strikes  its  solidus  is  7(X)  K  and  0.99,  and  the  melting  point  of  the  TlSb  in  the  CsQ  structure  is 
747  K.  It  is  evident  from  this  phase  diagram  that  growing  the  proper  alloy  from  the  melt  is  impossible, 
which  is  in  agreement  with  reported  experiments.**^ 

There  is,  however,  a  substantia]  temperature  range  where  the  proper  alloy  composition  is  stable 
and  it  should  be  possible  to  grow  the  ti.aterial  either  by  molecular  beam  epitaxy  (MBE)  or  metal  organic 
chemical  vapor  deposition  (MOCVD)  techniques.  One  attempt  to  grow  the  material  by  MBE  failed,'®  but 
that  was  done  without  the  benefit  of  the  information  in  Figure  1. 

Finally,  we  conclude  that  the  Ini.xTlxSb  alloy  should  be  stable  in  the  concentration  range  needed 
for  LWIR  focal-plane  arrays,  and  because  the  proper  composition  is  mostly  InSb,  it  should  process  in 
much  the  same  way.  However,  this  remains  to  be  tested  once  the  material  is  prepared. 
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TEMPERATURE  (K) 


Figure  1 .  Phase  diagram  of  Ini.xTIxSb.  For  x  «  1  the  zinc  bfende  structure  is  stable,  and  for  x  »  i  the 
CsCI  (cc)  structure  is  the  more  stable  one.  The  eutectic  point  is  x  =  0.675  and  T  *  687  K. 
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A -B.  CHEN,  ARDEN  SHER  AND  W.T  YOST 


I.  Introduction 


1.  Goals 

The  goals  of  this  paper  are  to  review  the  current  state  of  knowledge  of  the 
elastic  constants  of  elemental,  compound,  and  pseudobinary  alloy  semicon¬ 
ductors.  To  accomplish  this  objective,  we  will  review 

•experimental  methods  currently  used  to  measure  elastic  constants, 
•experimental  results, 

•binding  and  elastic  constant  theory,  and 
•related  mechanical  properties. 

Heavy  emphasis  is  placed  on  comparisons  between  theory  and  experiment, 
and  the  accuracy  of  approximations  currently  in  vogue.  The  theories  dis¬ 
cussed  range  from  first-principles  methods,  requiring  heavy  comutations,  to 
parametrized  physical  models.  The  intent  is  to  identify  a  logical  path  between 
these  extremes,  and  thereby  provide  insight  into  the  connection  between 
atom  potentials  and  semiconductor  mechanical  properties.  In  the  course  of 
this  presentation,  there  are  a  number  of  instances  where  improvements  in  the 
theory  or  additional  experimental  results,  are  needed.  We  have  tried  to 
highlight  these  situations  and  suggest  possible  remedies. 


2.  Definition  and  Calculation  of  Elastic  Constants 

The  theory  of  elasticity  of  solids  has  been  well  formulated  in  many  treatises 
(Love,  1944;  Landau  and  Lifshitz,  1959),  so  we  need  not  review  the  different 
formalisms  and  conventions.  However,  as  we  do  wish  to  present  a  coherent 
account  of  the  essence  of  the  theory,  it  is  necessary  to  define  terms  and 
describe  general  calculations  to  be  used  later. 

In  linear  elasticity  theory,  deformation  is  assumed  to  be  infinitesimal.  The 
relative  displacement  vector  x'  between  two  points  in  a  deformed  solid  is 
related  to  the  corresponding  vector  x  in  the  undeformed  solid  by  the 
following  equation,  in  component  form,  through  a  nine-component  strain 
sensor  e: 


x;  =  x„-l- Z^E.pX^.  (1) 

The  change  in  the  internal  energy  associated  with  £  is  also  small  and  will  be 
denoted  UV,  where  V  is  the  equilibrium  volume  of  a  solid.  Under  the 
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1 .  ELASTIC  CONSTANTS  AND  THEIR  ALLOYS 


3 


condition  that  the  entropy  and  the  electrostatic  displacement  field  are 
constant,  U  is  only  a  function  of  £,  and  is  quadratic  in  s: 


(2) 


where  are  the  elastic  stiffness  coefficients,  which  are  characteristic 
properties  of  the  solid.  From  this  definition,  is  required  for  U  to 

be  an  analytical  function  of  e.  Moreover,  is  also 

required  to  ensure  that  U  is  zero  under  any  infinitesimal  rigid  rotation.  In 
light  of  these  properties,  the  energy  density  can  be  expressed  in  terms  of  a 
symmetrical  strain  tensor  rj  as 


U  Itv  * 


(3) 


where  is  defined  as 


riati  =  ¥.^.0  +  W 

The  strain  tensor  rj  is  a.  thermodynamic  parameter  with  stress  tensor  a  as  its 
conjugate  variable  (Brugger,  1964).  The  components  of  a  are  given  by 

_SU_ 

It  is  clear  that  c  is  also  a  symmetrical  tensor. 

The  most  frequently  used  notation  is  the  engineering  convention,  in  which 
the  strain  tensor  e  is  related  to  ri  of  Eq.  (4)  by  =  rj„  for  the  diagonal 
components,  but  =  2^/,^  for  a  ^  p.  Furthermore,  since  e  has  at  most  six 
independent  components,  it  is  treated  as  a  six-component  vector,  with  the 
vector  components  1  to  6  corresponding  respectively  to  the  tensor  compo¬ 
nents  XX,  yy,  zz,  yz,  xz  and  xy.  In  terms  of  e  the  strain  energy  density  is  written 
as 


U  =  ^Z,jC,je,ej,  (6) 

where  C,j  can  be  identified  as  with  i  =  a)5  and  j  =  fiv.  Because  C,j  is 
symmetrical,  it  has  at  most  21  independent  components  for  any  crystal. 
Crystal  symmetries  reduce  this  number  further  (Love,  1944;  Ashcroft  and 
Mermin,  1976).  For  a  cubic  lattice,  to  which  class  the  zincblende  and 
diamond  semiconductors  belong,  there  are  only  three  independent  compo¬ 
nents,  namely  C^,  C12,  and  C44. 
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The  three  independent  elastic  constants  of  the  diamond  and  zincblende 
(zb)  semiconductors  can  be  calculated  by  considering  the  following  three 
strains. 

(i)  Under  a  uniform  expansion,  which  changes  a  displacement  x  into 
x'  =  (1  +  e)x,  then  =  62  =  =  e  and  the  other  strain  components  are 

zero;  the  elastic  energy  density  is  then  given  by 

U  =  3(Cu+2C,2)eV2.  (7) 

U  can  also  be  expressed  in  terms  of  the  adiabatic  bulk  modulus  B  defined  by 
dP  =  ~B(6V/V),  where  V  is  the  crystal  volume,  is  its  change,  and  SP  is 
the  corresponding  pressure  change.  The  result  is  U  =  B(<5K/K)*/2  =  9Be^/2, 
because  the  dilatation  is  SV/V  =  3e  in  the  present  case.  This  establishes  the 
relationship 


B  =  (C,y+2C,2)P.  (8) 

(ii)  The  next  case  to  consider  is  a  tetragonal  shear  strain  e  which  changes  a 
displacement  according  to 

(x,  y,  z)  (x  +  ex,  y  -  ey,  z) .  (9) 

The  only  nonzero  strain  components  are  Cj  =  —  Cj  =  e.  Then  U  becomes 

U  =  (C^,-C,2)e^.  (10) 

(iii)  To  calculate  C44  we  consider  a  shear  strain  e  that  changes  a 
displacement  according  to 

(x,  y,  2)  -» (x  +  ey/2,  y  +  ex/2, 2) .  (11) 

This  strain  contains  e^  as  the  only  nonzero  component.  Because  C44  equals 
Cgg,  the  energy  density  is  simply 


U  =  C44eV2.  (12) 

Although  the  macroscopic  crystal  distortion  of  the  Bravais  lattice  caused 
by  this  strain  is  described  by  Eq.  (11),  microscopically  there  is  a  relative 
displacement  u  =  (0, 0, «),  the  so-called  Kleiman  (1962)  internal  displace¬ 
ment,  between  two  successive  atomic  planes  perpendicular  to  the  2  axis.  In 
other  words,  the  relative  displacements  between  the  atoms  on  the  same  fee 
sublattice  are  governed  by  Eq.  (11),  but  there  is  an  additional  induced  relative 
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Fig.  1.  Distortion  of  a  tetrahedron  corresponding  to  the  C44  elastic  constant;  u  is  the  internal 
dispacement  between  the  anion  and  cation  sublattices. 

displacement  u  between  the  two  sublattices.  The  directions  of  the  displace¬ 
ments  of  atoms  in  a  tetrahedral  cell  are  shown  in  Fig.  1.  In  calculations  one 
can  use  an  arbitrary  infinitesimal  pair  of  e  and  u  to  obtain  the  coefficients  in 
the  following  quadratic  expansion  of  the  strain  energy  density: 

U  =  <Du  V2  +  Deu  +  C^y/2 ,  (13) 

where  the  force  function  <!>  is  related  to  the  transverse  optical  (TO)  phonon 
frequency  co  at  T  (the  center  of  Brillouin  zone)  by  <I>  =  with  n  being  the 
reduced  mass.  In  Eq.  (13),  D  is  a  constant,  and  021  would  be  the  shear 
stiffness  coefficient  if  the  internal  displacement  were  not  allowed.  Kleiman 
(1962)  defined  an  internal  displacement  parameter  that  is  related  to  the 
equilibrium  value  of  u  by  u  =  l^ae/A  for  a  fixed  e,  where  a  is  the  lattice 
constant.  Taking  the  first  derivative  of  U  with  respect  to  u  in  Eq.  (13)  and 
setting  it  equal  to  zero,  one  finds  that  the  C  value  is  given  by 

(;=  _4£)/(a<I»).  (14) 

Finally,  the  sought-after  C44  is  given  by 

C44  =  Cffl-CV<I>/16.  (15) 
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Thus,  the  internal  displacement  is  an  essential  part  of  C44.  These  procedures 
will  be  used  in  the  theoretical  calculations  to  be  presented  in  subsequent 
sections. 


3.  Elastic  Constants  and  Sonic  Wave  Propagation 

The  purpose  of  this  section  is  to  give  the  reader  sufficient  information  to 
measure  ultrasonic  wave  velocities  and  to  deduce  from  them  the  elastic 
coefficients  of  materials  of  interest  to  the  electronics  industry.  We  start  from 
the  definition  of  strain  and  develop  equations  that  relate  the  elastic  coeffi¬ 
cients  with  wave  propagation  velocities  in  various  crystallographic  direc¬ 
tions.  The  cases  taken  as  examples,  cubic  and  isotopic,  were  chosen  because 
of  their  prevalent  role  in  electronic  materials. 

A  number  of  techniques  used  to  measure  wave  propagation  velocities  in 
materials  are  discussed.  The  researcher’s  decision  on  technique  depends  upon 
a  number  of  factors,  such  as  accuracy  needed,  size  and  crystallographic 
orientation  of  available  samples,  equipment,  etc.  Sample  preparation  is 
significant,  since  a  carelessly  prepared  sample  yields  useless  velocity  data. 
Because  the  equipment  needs  vary  widely  among  the  techniques  that  can  be 
used,  references  containing  information  on  each  technique  are  given.  Key 
papers  chosen  for  citation  were  picked  for  their  readability  and  their  direct 
application  to  the  measurement  technique.  Primary  emphasis  has  been  given 
to  techniques  that  are  generally  accepted  and  that  give  results  that  are 
understood.  Throughout  this  section  a  general  theory  of  elasticity  due  to 
Murnaghan  (1951)  is  used. 


a.  Elastic-to-  Wave  Relationship 


Consider,  as  before,  a  point  in  a  lossless  elastic  medium  at  rest,  whose 
displacement  vector,  x  =  (x^),  a  =  1, 2, 3,  are  the  cartesian  coordinates  of  a 
point  in  the  medium.  In  the  presence  of  a  stress  wave  disturbance,  the  point 
moves  to  a  new  location,  x'  =  (x^  at  time  t.  The  Lagrangian  strains,  r\, 
defined  consistently  with  Eqs.  (1)  and  (4)  but  with  higher-order  terms 
retained  (Landau  and  Lifshitz,  1986;  Murnaghan,  1951),  are  as  follows: 


2  V^x^  ax,  "7 


(16) 


where  5^,  is  the  Kronecker  delta.  Einstein  summation  convention  over 
repeated  indices  is  used  in  this  section. 
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By  considering  the  internal  energy  per  unit  mass  of  the  medium 
£(x,,  rj^f,  S),  where  S  is  the  entropy,  we  obtain  a  relationship  between  wave 
propagation  and  the  internal  energy  of  the  medium.  If  we  restrict  this 
discussion  to  a  medium  that  is  initially  unstressed,  then  the  internal  energy 
per  unit  volume  of  the  medium  is  PoE(r^,  S),  where  Po  is  the  mass  density 
of  the  medium  in  the  unstrained  state.  By  expanding  this  in  a  power  series  of 
the  Lagrangian  strains,  we  obtain 


7a/)'  Po^i^a'  5)  +  ^ ^M»a/)yi7^v7a^7y4  "i" 

(17) 

The  coefficients  C^va^  ^tid  the  second-order  and  third-order  elastic 

coefficients  (adiabatic)  as  defined  by  Brugger  (1964); 


C 


livafi  Po 


d^E  \ 


(18) 

(19) 


The  wave  equation  can  be  derived  (Goldstein,  1965,  p.  347)  from  Lagrange’s 
equation 


d  /  dL  \  8L 
dt  dx^  ^  j  ax; 

where  x^  =  dxjdt  and  L  is  given  by 

^  -  Po£(Xa,  rjafiy  S) . 

Defining  the  particle  displacement  vector,  u,  as 

«^(^  t)  =  xj.  —  x^ , 


and  differentiating  it  gives 


du^  _  axj. 
ax,  “  ax, 


(20) 


(21) 


(22) 


(23) 
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Combining  Eqs.  (22),  (23),  and  (16)  gives  the  strain  tensor  in  terms  of  the 
particle  displacements. 


2  cx^  cx^  (9Xv 


(24) 


Using  Eqs.  (21)  to  (24)  in  Eq.  (20),  and  retaining  first-order  terms  in  dujdx^, 
gives 


PqUu  =  t-  ^ 


cu. 


We  assume  a  plane  wave  solution  to  Eq.  (25)  of  the  form 

•  ii  cos(/c„x,  -  cot) , 


(25) 


(26) 


where 


=  unit  displacement  vector  in  the  pih  direction, 
u  =  («^)  =  particle  displacement  vector, 
k  =  (/c  J  =  wave  propagation  vector,  and 
CO  =  angular  frequency. 

Substituting  Eq.  (26)  into  Eq.  (25),  we  obtain  the  eigenvalue-eigenvector 
equation 


=  0, 


(27) 


where 

(k^)  =  the  set  of  direction  cosines  of  k, 

V  =  o}f\k\  is  the  phase  speed,  and 

K  =  k/lk|  indicates  the  direction  of  wave  propagation  in  the  medium. 

For  any  given  wave  propagation  direction,  there  are  three  eigenvalue 
solutions  to  Eq.  (27);  these  correspond  to  one  quasilongitudinal  and  two 
quasitransverse  polarization  modes.  Relationships  have  also  been  derived  for 
wave  propagation  in  the  presence  of  residual  stresses  and  for  retention  of  the 
higher-order  elastic  coefficients  (Cantrell,  1982;  Breazeale  and  Ford,  1965). 

Because  both  the  stress  and  strain  tensors  are  symmetric,  one  can  reduce 
the  number  of  independent  second-order  elastic  coefficients  to  21  (Landau 
and  Lifshitz,  1986,  p.  32).  It  is  also  convenient  to  use  the  Voight  (1928) 
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contraction  of  the  indices  as  in  Eq.  (6),  which  is  used  in  the  remainder  of  this 
section.  The  number  of  independent  elastic  coefficients  varies  with  crystal 
structure*  and  is  listed  in  the  accompanying  table. 


Crystal  Class 

Number  of  Independent 
Elastic  Coefficients 

Triclinic 

21 

Monoclinic 

13 

Orthorhombic 

9 

Tetragonal  (C*,  S4,  C^h) 

7 

Tetragonal  (C4,.  D^,  D4J 

6 

Rhombohedral  (C3,  S^) 

7 

Rhombohedral  (Cj,,  Dj,  D34) 

6 

Hexagonal 

5 

Cubic 

3 

(Isotropic) 

2 

Green  (1973)  and  others  have  solved  Eq.  (27)  for  isotropic  materials  and 
for  cubic  crystals  in  the  [1(X)],  [110],  and  [111]  directions.  Using  the 
symmetry  properties  of  the  crystals,  one  obtains  the  results  presented  in 
Section  I.3.b  for  cubic  crystals,  and  in  Section  I.3.c  for  isotropic  solids. 


b.  Cubic  Crystals 

For  plane  waves  propagating  along  the  [100]  direction: 

(jci  =  1,K2  =0,  Kj  =  0); 

longitudinal  (compressional)  waves  (pure  mode  with  particle  displacements 
Uj  in  direction  of  propagation,  Uj  and  —  0)  have  a  phase  speed 


Ul  = 


(28) 


and  transverse  (shear)  waves  (pure  mode  with  particle  displacements  Ui  =  0, 
U2  and  U3  perpendicular  to  the  direction  of  propagation)  have  values  given  by 


'We  assume  that  the  angles  defining  the  orientation  of  axes  in  the  crystal  are  not  specified.  For 
a  further  discussion,  see  Landau  and  Lifshitz  (1986). 
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For  plane  waves  propagating  along  the  [110]  direction: 

(ki  =  Xjyjl,  K2  =  l/v/2,  »C3  =  0); 

for  longitudinal  (compressional)  waves  (pure  mode  with  particle  displace¬ 
ments  Ui  =  Uj,  U3  =  0),  one  obtains 


= 


-h  C12  +  2C, 
2po 


(30) 


and  for  transverse  (shear)  waves  (pure  mode  with  Uj  —  —  U2,  U3  —  0),  one 
obtains 


V2  = 


(31) 


or  for  transverse  (shear)  waves  (pure  mode  with  Uj  =  U2  =  0,  U3  #  0),  one 
obtains 


*^3  = 


(32) 


For  plane  waves  propagating  along  the  [111]  direction: 


(ki  =  l/y/3,  K2  =  1/^/3,  K3  =  (33) 


the  speed  for  longitudinal  (compressional)  waves  (pure  mode  with  Ui 
U3)  is  given  by 


Vx  = 


/Cjj  +  2Cj2  44  . 


3po 


U2  = 


(34) 


and  that  for  transverse  (shear)  waves  (pure  mode  with  u^Ki  -f-  U2K2  +  — 

0  or  particle  displacement  perpendicular  to  wave  propagation)  is  given  by 


^2  =  U3  = 


+  C44. 


(35) 


c.  Isotropic  Solids 

For  the  case  of  isotropy,  all  directions  are  equivalent,  we  are  left  with  two 
independent  constants  (Lame  constants).  The  elastic  coefficients  can  be 
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expressed  in  terms  of  the  Lame’  constants  (Green,  1973)  as  follows: 

Cn  =  C22 

^12  ~  ^13 
^21  “  ^31 
^44  ~  ^S5 

For  plane  waves  propagating  in  any  direction  (e.g.,  the  x-direction); 

(Ki  =  I,  K2  =  0,  ^3  =  0). 

the  speed  for  longitudinal  (compressional)  wave  (pure  mode  with  particle 
displacements  Ui  in  direction  of  propagation,  U2  and  U3  =  0)  is  given  by 


C33  =  /  +  2/2, 
C23  =  A, 

^66  —  A*- 


(36) 


Vi  = 


/.  +  2/i 


Po 


(37) 


and  that  for  transverse  (shear)  waves  (pure  mode  with  particle  displacements 
Ui  =  0,  and  U2  and  U3  perpendicular  to  the  direction  of  propagation)  is  given 
by 


V2  =  1^3 


(38) 


II.  Measurement  Methods 

4.  Velocity  Measurements 

To  measure  the  second-order  elastic  coefficients  (SOEC),  one  can  deter¬ 
mine  the  sound  velocity  and  the  density  of  the  sample,  and  calculate  the 
combination  of  SOEC  that  the  direction  of  propagation  requires.  For  cubic 
systems,  sound  velocities  measured  in  the  pure  mode  directions  equivalent  to 
[1(X)],  [110],  and  [111]  make  it  possible  to  determine  the  three  indeiDcndent 
elastic  constants,  Cu,  Cjj,  and  C44.  Using  care  in  sample  preparation,  and 
appropriate  corrections  for  bond  thickness  and  diffraction,  the  sound  veloci¬ 
ties  can  be  determined  to  parts  in  10^.  The  accuracy  of  a  velocity  determina¬ 
tion  is  usually  limited  by  the  accuracy  of  the  path  length  measurement  in 
pulsed  and  continuous  wave  techniques.  With  optical  techniques,  deflection 
angles,  wavelength  measurements,  and  frequencies  are  determinant  factors. 

In  general,  there  is  little  problem  in  obtaining  a  sound  velocity  measure¬ 
ment  on  the  order  of  1%  uncertainty.  But  to  improve  on  this,  one  must 
exercise  additional  care  in  the  preparation  of  surfaces,  the  control  of 
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temperature,  the  determination  of  acoustic  path  length,  and  the  determina¬ 
tion  of  travel  time  of  the  acoustic  wave.^  For  single  transducer  configura¬ 
tions,  where  a  transducer  is  used  to  send  and  receive  the  acoustic  wave,  the 
base  equation  for  determining  the  velocity  using  pulsed  techniques  or  pulse 
echo  without  corrections  is  as  follows: 

2L 

v  =  ^  (39) 

where  v  is  the  wave  velocity,  L  is  the  sample  length,  and  T  is  the  round- 
trip  time.  Corrections^  are  made  in  the  evaluation  of  T  and  vary  with 
the  technique  used  in  the  measurement.  Corrections  for  bond  thickness 
(McSkimin,  1961),  and  for  diffraction  effects  (Papadakis,  1967)  can  be 
measured  and/or  calculated.  Accuracies  of  some  of  the  more  frequently  used 
pulsed  techniques  are  given  in  Table  I.  A  comparison  of  the  accuracies  of 
various  techniques  are  discussed  by  Papadakis  (1972,  1976).  Using  contin¬ 
uous  wave  techniques,  the  propagating  plane  wave  model  (Bolef  and  Miller, 
1971),  the  basic  equation  without  corrections  is 

V  =  2LAf,  (40) 

where  A/  is  the  frequency  difference  between  two  adjacent  mechanical 
resonance  modes.  Typically,  velocity  measurement  accuracy  is  good  to  about 
1%  to  10%.  A  correction  factor  (Chem  et  al,  1981)  can  be  applied  to  the  right 
side  of  Eq.  (39)  that  corrects  for  the  effects  of  the  transducer  and  bond  on  the 
mechanical  resonances  of  the  sample.  When  applied,  the  inaccuracy  can  be  as 
low  as  five  parts  per  1(X),(XX),  neglecting  inaccuracies  in  path  length  measure¬ 
ments. 

If  the  sample  is  transparent,  optical  techniques  offer  a  convenient  method 
to  measure  sound  velocity.  For  some  of  these  techniques,  the  accuracy  is  on 
order  of  parts  per  thousand.  Still,  some  offer  accuracies  of  parts  per  ten 
thousand  and  can  be  used  with  small  samples,  and  very  high-frequency 
ultrasonic  waves  (Breazeale  et  al,  1981).  In  general,  the  calculation  for 


^The  accuracy  depends  upon  the  correct  choice  for  the  resonant  frequency  of  the  transducer, 
which  can  be  obtained  from  the  fact  that  it  is  half  a  wavelength  in  thickness.  Using  a  micrometer, 
and  the  wave  velocity  of  the  transducer  material,  one  can  calculate  the  resonant  frequency  of  the 
transducer.  Transducer  off-resonant  conditions  can  have  a  relatively  large  influence  on  the 
measurement  of  round-trip  time  for  pulsed  studies. 

^Other  influences  on  the  measurement  of  round-trip  time  are  bond  thickness  and  diffraction. 
These  are  discussed  in  various  papers,  including  McSkimin  (1961)  and  Papadakis  (1967,  1972). 
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velocity  involved  the  determination  of  acoustic  wavelength  in  the  medium,  by 
using  the  equation 


v=p.,  (41) 

where  /  is  the  frequency  of  the  sound  wave,  determined  by  the  drive  frequency 
of  the  transducer,  and  /  is  the  wavelength.  Brillouin  scattering  also  makes  it 
possible  to  measure  the  sound  velocity  (Beyer  and  Letcher,  1969).  In  addition 
to  measurement  of  an  angle,  the  accuracy  also  depends  on  determination  of 
the  wavelength  of  light  in  the  medium;  this  requires  a  determination  of  the 
index  of  refraction  of  the  medium. 


5.  Ultrasonic  Measurement  Techniques 

Surveys  of  techniques  and  details  for  measuring  ultrasonic  velocities  are 
available  from  various  sources  (Breazeale  et  a/.,  1981;  Papadakis,  1976; 
Truell  et  ai,  1969).  Consideration  in  this  work  will  be  given  to  three  classes  of 
techniques  for  absolute  velocity  measurements.  A  compilation  of  some  of  the 
features  of  each  technique  is  given  in  Table  I. 


a.  Sample  Preparation 

The  tolerance  selected  for  preparing  the  sample  depends  largely  upon  the 
accuracy  needed  in  the  determination.  If  a  1%  measurement  is  in  order, 
surface  preparation  and  parallelism  are  less  critical  than  if  an  accuracy  of 
0.01  %  is  desired.  Also  important  is  the  consideration  of  correction  factors 
needed  to  compensate  for  bond  thickness  and  diffraction  effects.  At  present, 
diffraction  corrections  exist  only  for  compressional  waves.  Therefore,  under 
equivalent  experimental  conditions,  the  most  accurate  determination  of 
combinations  of  elastic  coefficients  would  be  those  that  are  calculated  from 
compressional  wave  velocities. 

It  is  assumed  that  a  surface  on  the  sample  has  been  ground  to  optical 
tolerances,  and  that  the  crystallographic  direction  of  the  axis  of  this  surface 
has  been  determined.  Typically,  the  crystallographic  directions  are  measured 
to  within  minutes  of  arc  (McSkimin  and  Andreatch,  1964).  The  remaining 
critical  issues  are  the  parallelism  of  the  reflecting  surface  to  the  reference 
surface,  and  the  flatness  of  these  surfaces.  Parallelism  of  the  surfaces  can  be 
measured,  for  example,  with  an  autocolliminator  or  a  He-Ne  gas  laser  and 
some  mirrors.  By  placing  the  sample’s  reference  surface  on  a  stationary  flat, 
dust-free  surface,  rotating  the  sample,  and  measuring  the  diameter  of  the 
circle  traced  by  the  beam  reflected  from  the  sample’s  top  surface,  and 
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measuring  the  total  path  length  of  the  beam  from  the  sample  to  the  image 
screen,  one  can  determine  the  parallelism  of  the  surfaces.  The  surfaces  should 
be  parallel  to  an  angle  better  than  0.01  A,„uj„^/transducer  diameter.  For  a 
typical  half-inch  diameter,  10  MHz  transducer  and  a  typical  solid,  the 
surfaces  must  be  parallel  to  better  than  4  x  10"“*  radians  (1  degree,  23  min¬ 
utes).  Higher  frequencies,  multiple  reflections,  and  continuous-wave  tech¬ 
niques  require  proportionally  smaller  tolerances. 

Typically,  one  also  tries  for  a  flatness  of  better  than  1/100  of  an  acoustic 
wavelength  for  accurate  determination  of  transit  times.  If  the  resonant 
frequency  of  the  transducer  is  10  MHz,  then  the  wavelength  in  a  solid  is  in  the 
neighborhood  of  5  x  10 meters,  which  puts  the  flatness  requirement  near 
5  micrometers.  Using  an  optical  flat  and  Newton’s  rings  analysis,  one  can 
determine  the  flatness  of  the  sample.  For  higher-frequency  transducers  and 
continuous-wave  techniques,  the  number  of  fringes  are  appropriately  de¬ 
creased.  At  100  MHz,  pulsed  mode,  for  example,  the  tolerance  is  down  to  less 
than  several  fringes.  By  using  flat  lapping  surfaces,  this  is  easily  achieved. 
When  multiple  reflections  are  involved  in  the  measurement,  one  must 
consider  that  for  each  reflection,  changes  occur  in  the  wavefront  direction; 
the  phase  change  across  the  surface  of  the  transducer  at  each  reflection  is  due 
to  lack  of  parallelism  between  the  faces.  For  a  case  where  a  20  MHz 
transducer  was  used,  and  multiple  reflections  employed  for  the  measurement, 
Papadakis  (1967)  quotes  a  sample  surface  parallelism  and  flatness  of  better 
than  10"“*  inches  per  inch. 

b.  Piezoelectric  Transducers 

Selection  of  the  piezoelectric  transducer  depends  largely  upon  the  wave 
mode,  the  electronic  equipment  to  be  used  for  the  measurement,  and  the 
personal  preference  of  the  researcher.  In  general,  piezoelectric  crystals^  such 
as  quartz  (high  electric  impedance)  and  lithium  niobate,  or  poled  ceramics 
such  as  PZT  (lead  zirconate  titanate),  are  chosen.  Their  physical  and 
electrical  properties  are  covered  elsewhere  (O’Donnell  et  ai,  1981).  Other 
piezoelectric  transducers,  including  polymeric  materials,*  are  also  available. 
The  transducer  diameter  should  be  smaller  than  the  sample  to  assure  that 
propagation  modes  are  not  affected  by  the  location  of  lateral  boundaries  (Tu 

‘Information  on  piezoelectric  transducers  is  available  from  Valpey-Fisher  Corp.,  75  South 
Street,  Hopkinton,  Massachusetts  01748,  and  Crystal  Technology  Inc.,  1060  E.  Meadow  Circle, 
Palo  Alto,  California  94303. 

’A  film  sold  under  the  tradename  KYNAR  is  an  example  of  this.  Information  on  this  material 
can  be  obtained  from  Pennwalt  Corp.,  Box  C.  King  of  Prussia,  Pennsylvania  19406-0018.  Some 
of  the  properties  are  covered  in  Bloomfield  et  al.  (1978).  Lead  attachment  to  KYNAR  films  is 
covered  in  Scott  and  Bloomfield  (1981). 
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et  ai,  1955).  This  is  especially  important  where  measurements  are  taken  on 
small  samples  that  may  have  their  lateral  sides  close  to  each  other.  One  also 
must  consider  the  problem  as  it  relates  to  wave  propagation,  since  wave 
modes  other  than  those  considered  here  can  be  excited  in  materials  of  small 
dimensions.  From  studies  on  cylindrical  specimens,  the  minimum  sample 
diameter  can  be  no  less  than  approximately  2.5a,  where  x  is  the  wavelength  of 
the  ultrasound  in  the  medium.  The  transducer  diameter  is  chosen  after  the 
frequency  is  selected.  Generally,  the  transducer  diameter  should  be  no  longer 
than  half  the  diameter  of  the  sample,  to  prevent  any  interference  in  the 
measurement  from  reflections  off  the  lateral  boundaries  caused  by  diffraction 
effects. 

Selection  of  a  bonding  material  depends  upon  the  type  of  wave  (compres- 
sional  or  shear),  and  the  temperature  range  through  which  the  measurements 
are  to  be  taken.  At  room  temperature,  a  good  choice  for  both  types  of  waves 
in  phenyl  salicylate  or  phenyl  benzoate  (Papadakis,  1964),  while  Dow- 
Corning  DC-200  silicone  is  good  for  longitudinal  waves.  At  other  tempera¬ 
tures  (McSkimin,  1957),  other  bonding  materials,  such  as  Nonaq  stopcock 
grease,  are  used  for  compressional  and  shear  waves.  Various  resins  can  be 
used  with  shear  wave  transducers;  commercially  prepared  bonding  materials 
are  also  available.^ 

Application  of  the  bond  requires  care  to  keep  surfaces  clean  and  free  of 
dust.  The  bond  should  be  as  thin  as  practical,  taking  care  not  to  break  the 
transducer.  Often,  applying  some  heat  helps  with  viscious  bonding  materials. 
When  using  phenyl  salicylate  or  phenyl  benzoate  for  bonds,  certain  proce¬ 
dures  can  be  followed  to  assure  uniform  bonds  (Papadakis,  1964a). 


c.  Noncontacting  Transducers 

There  are  three  types  of  non-contacting  excitation  transducers:  capacitive 
transducers  (compressional  wave  excitation  only)  (Cantrell  and  Breazeale, 
1977),  electromagnetic  transducers  (EM AT)  (Vasile  and  Thompson,  1977; 
Johnson  and  Mase,  1984),  and  optically  stimulated  acoustic  transducers 
(Prosser  and  Green,  1985).  With  the  exception  of  the  electrostatic  transducer 
(compressional  with  circular  piston  geometry),  diffraction  correction  data 
have  not  been  developed.  However,  bond  corrections  are  not  necessary  as 
these  methods  generate  the  wave  directly  on  the  sample  surface.  Generally, 
the  signal  levels  are  quite  small,  and  require  high  gain  and  specialized  circuits 
or  devices  to  bring  the  signals  to  usable  levels. 

^For  example,  suppliers  of  damped  ultrasonic  transducers  can  supply  material  suitable  for 
high  temperature  and  shear  measurements.  Two  companies  are  Panametrics,  Inc.,  221  Crescent 
St.,  Waltham,  Massachusetts  02254,  and  Harisonics.  Inc.,  7  Hyde  St.,  Stamford,  Connecticut 
06907. 
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6.  Corrections  to  Pulse  Measurements;  An  Example 

With  some  pulsed  systems,  and  with  care  in  preparing  the  sample  and  in 
taking  the  measurements,  one  can  expect  round-trip  time  determinations  to 
have  standard  deviations  in  the  neighborhood  of  100  picoseconds.  As  correc¬ 
tions  for  bond  thickness  and  diffraction  are  often  larger  than  this,  it  is 
necessary  to  correct  for  these  sources  of  systemic  error  where  possible.  As  an 
example,  consider,  for  a  properly  prepared  sample,  that  the  pulse-echo 
overlap  technique  is  to  be  used  to  measure  the  round-trip  time  for  a  tone 
burst.  The  time  between  the  first-received  and  the  second-received  echos  will 
be  determined.  For  simplicity,  we  will  treat  the  case  of  only  one  round  trip  in 
the  sample.  The  measured  time  T  is  composed  of  several  terms  ,  the  true  travel 
time  S,  the  bond  thickness  contribution  A®,  and  the  diffraction  contribution 
A°. 


a.  Bond  Thickness 

The  model  for  the  sample-bonded  transducer  system  has  been  developed 
by  Williams  and  Lamb  (1958)  and  has  been  used  extensively  by  McSkimin 
(1961)  and  by  Papadakis  (1967).  They  showed  that  if  the  transducer  bond 
sample  system  is  treated  as  an  acoustic  transmission  line,  it  is  possible  to 
calculate  the  effects  of  bond  thickness  on  the  measurement  of  transit  time. 
McSkimin  (1961)  showed  that  by  measuring  the  change  in  the  transit  time 
when  the  system  is  detuned  by  a  known  amount,  it  is  possible  to  determine 
the  bond  thickness  in  terms  of  measured  quantities.  This  permits  one  to 
correct  for  the  transit  time  through  the  bond  material.  Papadakis  (1967) 
discussed  the  correction  in  some  detail,  and  applied  it  to  his  measurements  on 
fused  quartz  and  silicon. 

Consider  an  undamped  transducer  bonded  to  one  end  of  a  sample  with 
some  bonding  material  of  known  velocity  and  density;  a  relatively  long  wave 
is  reflected  from  the  interface  between  the  sample  and  the  transducer  bond 
system.  The  reflected  wave  from  this  interface  experiences  a  phase  shift  from 
the  impedance  mismatch,  which  can  be  calculated  from  the  model.  The 
calculation  uses  the  real  and  imaginary  parts  of  the  effective  impedance  of  the 
components  to  obtain  the  contribution  to  the  phase  shift.  By  detuning  the 
frequency  of  the  applied  tone  burst,  we  can  measure  a  corresponding  change 
in  transit  time.  Then,  a  comparison  with  the  measured  values  p)ermits  the 
calculation  of  the  bond  thickness,  so  that  its  effect  on  the  measurement  can  be 
determined.  The  approach  is  outlined  below. 
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The  reflection  coefficient  for  the  system  described  above  is  given  by 


~  ■^s 

£i‘Zd  +  2.’ 


(42) 


where  is  the  reflected  pressure  wave,  Ej  is  the  incident  pressure  wave,  Z,  is 
the  specific  acoustic  impedance  (Elmore  and  Heald,  1969)  of  the  sample 
(Z,  =  pc,  where  p  is  the  mass  density,  and  c  is  the  wave  propagation  speed), 
and  Zd  is  the  effective  specific  acoustic  impedance  of  the  transducer  bond 
system,  given  by  McSkimin  (1961), 


■^d  —j^i 


|tan  /c,A,  +  tan  /cjAj 


—  tan  /cjAi 


(43) 


where  and  Zj  are  the  specific  acoustic  impedances  of  the  bond  material 
and  transducer  material,  respectively.  Using  similar  conventions,  /c,  and 
are  propagation  constants  in  the  respective  materials,  as  Aj  and  A 2  are  the 
respective  bond  and  transducer  thicknesses,  and  j  is 
The  phase  angle  (McSkimin,  1961)  is  calculated  by  writing  Eq.  (42)  in 
complex  polar  form  and  gives 


y  =  arctani 


2|Zd|Z,  \ 

IZdl^-ZsV’ 


(44) 


where  \Z^\  is  the  modulus  of  Zj,  and  (n  +  y)  is  the  phase  shift  between  the 
reflected  and  incident  pressure  waves  impinging  on  the  interface  of  the 
transducer  bond  system  with  the  sample. 

When  the  system  is  driven  at  the  transducer  resonant  frequency,/,  can  be 
calculated  by  using  the  fact  that  the  transducer  at  fundamental  resonance  has 
a  thickness  of  one-half  of  a  wavelength;  then  IZ^I  becomes  Z,  tan/ciAj, 
which  is  generally  quite  small  for  a  thin  bond.  Frequencies  of  off-transducer 
resonance,  generally  chosen  as  0.9 /„  give  a  larger  phase  angle,  since  the 
dependence  of  Z^  upon  off-resonance  frequency  excitation  is  large.  As 
outlined  in  McSkimin  (1961)  and  Papadakis  (1967),  we  can  calculate  the  time 
difference  between  phase-matched  echo  trains,  AT,  caused  by  the  change  of 
drive  frequency,  and  compare  it  with  experimental  results  to  determine  the 
correct  condition  for  overlap. 
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Assume  an  alignment  of,  for  example,  the  first  and  second  echoes  in  the 
sample.  The  change  in  the  measured  time  between  echos  is  given  by 


AT  = 


(45) 


where  p  is  the  number  of  round  trips  (for  our  case,  p  =  1)  of  the  ultrasonic 
tone  burst  in  the  sample,  n  is  the  number  of  cycles  of  mismatch  in  the  tone 
burst,  fi  is  the  off-transducer  resonance  drive  frequency,  /r  is  the  drive 
frequency  at  transducer  resonance,  Vl  is  the  phase  angle  at  the  off-resonance 
frequency  (radians),  and  Vr  is  the  phase  angle  (radians)  at  the  transducer 
resonance.  One  can  calculate  AT  for  the  case  of  no  cycle  mismatch  (n  =  0) 
and  no  bond  thickness  (y*  =  0)  from  Eqs.  (43)  through  (45).  The  measured 
value  closest  to  the  calculated  value  determines  the  correct  experimental  cycle 
for  cycle  match.  Following  the  experimental  identification  of  the  correct 
match,  the  measured  AT  can  be  used  to  determine  the  correction  for  bond 
thickness  by  adjusting  A,  in  Eq.  (43)  and  solving  Eq.  (44)  to  bring  Eq.  (45) 
into  agreement  with  the  measured  AT  values.  The  phase  angle  at  resonance 
can  be  determined,  and  the  travel  time  correction  for  bond  thickness.  A®,  is 
given  by 


A®  = 


(46) 


where  Vr  and  /r  are  the  phase  angle,  and  drive  frequency  respectively,  both 
taken  at  transducer  resonance. 


b.  Diffraction  Corrections 

Diffraction  effects  for  compressional  waves  in  various  crystalline  symme¬ 
tries  have  been  treated  by  several  investigators  (Seki  et  ai,  1956;  Papadakis, 
1963, 1964b,  1966;  Williams,  1970),  while  others  have  treated  isotropic  media 
(Benson  and  Kiyohara,  1974;  Khimunin,  1972;  Rogers  and  Van  Buren, 
1974).  Papadakis  (1972)  has  shown  that  without  diffraction  corrections,  one 
can  expect  errors  in  travel  times  as  large  as  0.25//r.  He  also  discussed  (1972) 
diffraction  corrections  for  the  technique  of  pulse-echo  overlap  technique. 
Using  the  dimensionless  quantity 


S 


ZA 


a- 


(47) 
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where  S  is  the  Seki  parameter,  a  is  the  transducer  radius,  /  is  the  wavelength 
in  the  sample,  and  z  is  the  distance  of  propagation,  he  wrote  the  phase 
correction  due  to  diffraction,  as 


(48) 

2nf 

where  (j)  is  the  phase  shift  in  radians  of  the  received  wave  front  that  is  due  to 
diffraction  effects.  For  the  example,  the  sample  is  of  length  L,  which  gives 
S„  =  2L/.ia~,  and  =  AL/Ja}. 

The  true  travel  time,  b,  can  be  written  in  terms  of  the  measured  time,  T,  and 
the  corrections; 

<5  =  T  +  A®  +  (49) 


The  thickness  of  the  sample,  L,  is  measured  by  conventional  means,  such  as  a 
high-precision  micrometer,  and  the  velocity  of  sound  is  given  by 


2L 


(50) 


7.  Optical  Techniques 

Optical  techniques  have  been  used  with  success  in  the  measurement  of 
sound  velocities  in  transparent  media.  The  methods  include  diffraction  and 
scattering  of  light  by  sound  waves.  A  direct  method  of  optically  measuring  an 
acoustic  wavelength  in  the  material  can  also  be  used. 

The  diffraction  techniques  considered  here  permit  the  determination  of  the 
wavelength  of  sound.  The  wavelength.  A*,  is  combined  with  the  ultrasonic 
frequency,  /,  of  the  sound  beam  to  calculate  its  velocity,  according  to  the 
expression 

P*  =  v,  (51) 

where  v  is  the  speed  of  sound  in  the  sample.  In  any  optical  technique, 
however,  one  must  specify  the  type  of  diffraction  experienced  by  the  light. 


a.  Fraunhofer  Diffraction  of  Light  by  Sound 

Consider  an  ultrasonic  transducer,  bonded  to  a  transparent  solid,  generat¬ 
ing  ultrasonic  compressional  waves  that  propagate  into  the  solid.  Further 
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suppose  that  the  sound  wave  encounters  light  traversing  the  same  medium  so 
that  the  light  is  diffracted.  There  are  two  different  physical  regimes  that  can 
produce  the  diffraction  effects.  One  involves  the  formation  of  a  corrugation  in 
the  phase  fronts  of  the  light  that  is  due  to  the  spatial  vanation  in  the  index  of 
refraction  of  the  solid,  as  caused  by  the  sound  waves.  This  is  called 
Raman-Nath  diffraction  (Raman  and  Nath.  1935a,  1935b,  1936a,  1936b, 
1936c;  Born  and  Wolf,  1970).  The  second  regime  involves  the  reflection  of 
light  from  the  evenly  spaced  crests  of  the  sound  waves.  These  reflections  occur 
under  some  conditions  that  are  similar  to  x-ray  diffraction  by  a  crystal  lattice; 
this  IS  called  Bragg  diffraction  (Bhatia  and  Noble,  1953).  Both  types  of 
diffraction  can  be  used  to  determine  the  sound  velocity  of  the  material. 

In  order  to  determine  '"hich  type  of  diffraction  effect  predominates 
(Nomoto,  1942),  a  dimensionkss  parameter  (Klein  et  al.,  1965),  Q,  is  defined 
as 


Q  = 


(52) 


where  K*  is  the  ultrasonic  propagation  constant,  L  is  the  width  of  the 
ultrasonic  beam,  Hq  is  the  index  of  refraction,  and  K  is  the  propagation 
constant  of  light  in  vacuum.  If  0  >  9,  one  has  Bragg  d’ffraction.  U  Q  <  I, 
Raman-Nath  diffraction  occurs.  For  1  <  Q  <  9,  the  diffra'’tion  is  mixed.  For 
illustrative  purposes,  consider  a  typical  transparent  solid  vC,ou„d  ~  5  x  10^ 
m/s,  index  of  refraction  ss  1.5,  and  ultrasonic  beam  width  s:  1.27  x  10"^  m), 
illuminated  with  light  at  wavelength  632.8  nm.  If  the  ultrasonic  frequency  is 
approximately  27  MHz  or  less,  the  interaction  satisfied  Raman-Nath  diffrac¬ 
tion  conditions.  If  the  frequency  is  greater  than  approximately  81  "MHz,  the 
interaction  is  governed  by  Bragg  diffraction  conditions. 


b.  Raman-Nath  Diffraction 

For  the  case  of  Raman-Nath  diffraction,  we  consider  the  light 
beam  impinging  on  the  medium  at  an  angle  4>,  with  the  light  normal  to  the 
sound  beam.  The  location  of  the  diffraction  orders  are  given  by  the 
expression  (Breazeale  et  al.,  1981) 


sin(0„ -f  <^)  -  sin(</>)  =  — ,  (53) 

where  (f)  is  the  angle  of  incidence,  6  is  the  angle  of  diffraction,  /  is  the 
wavelength  of  light  in  the  medium,  2*  is  the  ultrasonic  wavelength,  and  n  is  an 
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integer.  One  can  experimentally  set  </>  =  0,  which  reduces  Eq.  (53)  to 

sin(0„)  =  p.  (54) 

The  ultrasonic  wavelength  can  be  determined  from  Eq.  (53).  With  the 
measurement  of  the  ultrasonic  frequency,  one  can  determine  the  sound 
velocity  of  the  compressional  wave  (Barnes  and  Hiedemann,  1957).  In 
determining  the  wavelength  measurement  and  its  uncertainty,  it  is  necessary 
to  make  an  analysis  of  the  optical  setup,  including  the  effects  of  Snell’s  Law  at 
interfaces. 

c.  Bragg  Diffraction 

For  the  case  of  Bragg  diffraction,  the  angle  of  incidence,  tf),  is  set  to  the 
angle  of  diffraction,  and 

nX  =  2X*sm<t>ji,  (55) 

where  <f)^  is  the  Bragg  angle.  Bragg  diffraction  is  used  to  measure  wave 
velocities  in  the  frequency  range  from  approximately  100  MHz  to  the  low  end 
of  the  gigahertz  scale.  For  example  (Krischer,  1968),  the  technique  has  been 
used  to  measure  wave  velocity  to  an  estimated  accuracy  of  better  than  0.1%. 
It  is  also  useful  in  measuring  the  local  velocities  within  a  sample  (Simondet 
et  a/.,  1976;  Michard  and  Perrin,  1978).  Measurements  with  precision  of 
better  than  0.01%  in  homogenous  samples  have  been  reported  (Simondet 
et  al.,  1976). 

d.  Direct  Measurements  ( Fresnel  Diffraction ) 

Consider  a  standing  ultrasonic  wave  in  a  sample  through  which  collimated 
light  is  passed  so  that  the  collimated  light  beam  is  perpendicular  to  the  sound 
beam.  A  measuring  microscope  or  similar  optical  device  is  focused  so  that  the 
image  of  the  wave  can  be  viewed  at  the  instrument  focal  plane.  Because  the 
images  of  the  wave  fronts  are  kjl  apart,  it  is  possible  to  determine  the 
ultrasonic  velocity  from  measurements  of  ultrasonic  frequency  and  measure¬ 
ments  of  the  wavelength  in  the  medium.  The  technique  is  sensitive  enough  to 
detect  local  variations  in  velocity  greater  than  0.01%  (Hiedemann  and 
Hoesch,  1934,  1937;  Mayer  and  Hiedemann,  1958,  1959). 

e.  Brillouin  Scattering 

Consider  the  scattering  of  a  photon  by  a  high-frequency  phonon  traveling 
in  a  specific  direction  within  a  crystal.  Application  of  conservation  of  energy 
and  momentum  to  the  scattering,  coupled  with  the  approximation  that  any 
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photon  frequency  shift  is  small  (Benedek  and  Fritsch,  1966;  Beyer  and 
Letcher,  1969,  pp.  47-50),  gives 


where  v  is  the  photon  frequency,  v*  is  the  phonon  frequency  as  well  as  the 
difference  in  frequency  of  the  scattered  photons,  C  is  the  speed  of  the  photon 
in  the  medium,  v  is  the  speed  of  the  phonon,  and  6  is  the  scattering  angle.  If 
desirable,  the  Bragg  condition  can  be  used  for  constructive  reinforcement  by 
adjusting  6  (Pollard,  1977).  The  technique  gives  three  lines  in  the  scattered 
photon  spectra,  which  can  be  separated  and  measured  with  appropriate 
optical  devices,  such  as  Fabry-Perot  interferometers,  to  obtain  v*.  Equa¬ 
tion  (56)  can  be  solved  for  v  in  terms  of  the  other  quantities.  Brillouin 
scattering  is  useful  in  the  investigation  of  the  sound  velocity  of  a  material  near 
a  phase  transition  (Fleury,  1970,  pp.  37-42).  The  technique  has  been  used  for 
both  longitudinal  and  mixed  modes.  However,  in  one  study  on  cubic  crystals, 
no  Brillouin  scattering  was  observed  from  an  acoustic  branch  that  consisted 
of  pure  transverse  waves  (Benedek  and  Fritsch,  1966).  Uncertainties  in  the 
index  of  refraction,  the  scattering  angle,  and  the  width  of  the  Stokes  and  anti- 
Stokes  lines  influence  the  accuracy  of  the  determination.  The  precision  is 
considerably  better,  however,  with  values  of  0.1%  as  mentioned. 

In  stimulated  Brillouin  scattering,  the  photon  scattering  process  is  depen¬ 
dent  upon  the  intensity  of  the  radiation  striking  the  surface.  With  high 
enough  light  intensity,  nonlinear  effects  occur,  which  result  in  scattering  by 
frequencies  and  harmonics  created  by  harmonic  generation  (Brewer,  1965).  A 
large  buildup  of  acoustic  intensity,  both  compressional  and  shear  waves, 
accompanies  a  threshold  in  optical  intensity  (Chiao  et  ai,  1964).  Other  effects 
include  the  possibility  of  sample  destruction  Irom  the  intense  radiation,  and 
the  line  pulling  effects  of  the  laser  cavity  on  the  scattered  light,  which  affects 
accuracy  (Fleury,  1970,  pp.  57-58).  Amplified  acoustic  frequencies  have  been 
reported  as  high  as  60  GHz. 


III.  Theoretical  and  Experimental  Results 

8.  Ab  Initio  Theory 

It  is  clear  from  Section  2  that  the  calculation  of  elastic  constants  requires 
an  accurate  computation  of  the  variation  in  the  total  energy  of  a  solid,  from 
equilibrium  to  distorted  configurations.  Thanks  to  the  availability  of  power¬ 
ful  computers,  great  advances  in  ab  initio  total-energy  calculation  have  been 
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made  in  recent  years  (Anderson  et  al.,  1985).  In  this  section  we  briefly  discuss 
the  approximations  used  in  ab  initio  theory  and  summarize  the  calculated 
results. 

The  Hamiltonian  of  a  solid  consists  of  five  parts;  the  two  kinetic  energies  of 
the  ions  and  electrons,  K,  and  respectively,  and  the  potential  energies  l/|, 
among  ions,  among  electrons,  andL^ei  between  electrons  and  ions.  The 
Born-Oppenheimer  (1927)  adiabatic  approximation  is  a  simple  way  to 
separate  the  electronic  from  the  ionic  variables.  In  this  approximation, 
because  the  ion  speed  is  at  least  two  orders  smaller  than  the  electron  speed, 
one  freezes  the  ionic  motion  in  a  configuration  specified  by  a  set  of  ion 
position  vectors  R„,  then  solves  the  Schrodinger  equation  for  that  part  of  the 
Hamiltonian  that  involves  the  set  of  electronic  coordinates  {rj: 


{R„])  =  E,i{R„})Mf,},  {R„]),  (57) 

where  H  =  K^-\-  (/„  +  L/,,.  Thus  the  energy  is  a  function  of  the  ionic 
configuration^  The  lowest  energy  curve  of  the  sum  of  E^  =  Ey+  C/„  as  a 
function  of  {R„}  then  serves  as  the  potential  energy  for  the  ionic  motion.  A 
Taylor  expansion  of  £g  about  its  minimum  value  takes  the  form 

£,{£„}  =  £„  +  i  +  •  •  •  ^  (58) 

where  u„,  is  the  a-component  of  the  small  displacement  R„  —  R„{0)  with 
{£„(0)}  the  equilibrium  ionic  positions  at  the  minimum  energy  Eq.  The  force 
functions  are  the  second  derivatives  of  £,  with  respect  to  the  displace¬ 
ments  evaluated  at  the  ion  equilibrium  positions.  The  force  functions  are 
directly  related  to  the  elastic  constants.  For  example,  for  a  Bravais  lattice  they 
are  given  by  Aschcroft  and  Mermin  (1976)  as 

(59) 

where  the  sum  is  over  all  the  lattice  vectors  L. 

Equation  (57)  is  computationally  the  most  difficult  part  of  the  problem, 
because  it  deals  with  about  10^^  electrons  that  are  interacting  with  each  other 
and  with  the  ions;  the  wave  functions  have  to  be  the  properly  antisym¬ 
metrized  many-body  functions.  Self-consistent  density-functional  theory 
(SCDFT)  (Hohenberg  and  Kohn,  1964;  Kohn  and  Sham,  1965;  Callaway 
and  March,  1984),  which  casts  the  Hamiltonian  into  a  density  functional, 
reduces  the  many-body  problem  to  an  effective  one-electron  problem.  This 
theory  has  been  tested  in  many  crystalline  solids,  and  the  resulting  elastic 
constants  have  been  excellent,  especially  for  semiconductors. 
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In  SCDFT,  the  ground-state  energy  of  a  solid  is  completely  specified  by 
single-particle  wave  functions  of  the  occupied  states  {<^v}.  First,  the  electron 
density  is  given  by 

pif)  =  IJ(^,(f)|2. 

(60) 

Then,  the  ground  state  energy  is  computed  as  follows: 

£*  =  +  t/cc  +  t;.,  +  -h  L/,,, 

(61) 

where  the  different  terms  are  given  by  the  expressions 

/C,  =  1 0J(r)  ~  <i),if)  d^r, 

(62) 

(63) 

J  K—  -''nl 

(64) 

v„  =  -  R.I. 

(65) 

L/,,  =  p(f)E^clPi^¥^r . 

• 

(66) 

Note  that  n  =  m  is  excluded  in  the  sum  in  Eq.  (65).  The  meaning  and  the 
notations  of  the  above  equations  are  mostly  self-evident,  except  the  £,cLp] 
Eq.  (66);  this  is  the  correction  term  arising  from  the  many-body  exchange  and 
correlation  effects.  The  square  bracket  means  that  is  a  functional  of  p. 
Several  different  expressions  for  as  a  function  of  p,  available  in  the 
literature  (for  example,  Wigner,  1934;  Hedin  and  Lunquist,  1971;  Ceperley 
and  Alder,  1980;  Perdew  and  Zunger,  1981),  have  yielded  similar  results  for 
the  structural  properties  of  semiconductors.  A  minimization  of  £,  with 
respect  to  (f>*,  with  the  constraint  that  the  total  number  of  electrons  is  a 
constant,  leads  to  the  familiar  Schroedinger  single-particle  equation: 

Tl;  +  =  £.*.(?),  (67) 
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where  is  a  Lagrange  multiplier  and  V  is  an  effective  one-electron  potential 
containing  three  parts: 


=  K.  +  Ki  + 

(68) 

(69) 

Vfi  is  the  Coulomb  potential  due  to  ionic  charges  Z„ : 

K,  =  -e^l„ZJ\f-RJ.  (70) 

Finally,  the  exchange-correlation  potential  is  given  by 

=  +  (71) 

dp 

Thus,  Eqs.  (60)  and  (67)-(71)  form  a  repeated  loop,  p  V  <t>  p,  and 
the  calculation  must  be  iterated  until  self-consistency  is  achieved.  Following 
this  recipe  for  calculating  the  total  energy,  one  calculates  £,  as  a  function  of 
ionic  positions,  finds  the  equilibrium  configuration,  then  imposes  a  strain  and 
calculates  the  strain  energy  to  deduce  the  elastic  constants,  following  the 
prescription  of  Section  2.  The  problem  then  becomes  strictly  computational. 
The  most  challenging  task  is  an  accurate  solution  for  the  single-particle  eigen 
states  of  Eq.  (67).  For  a  crystalline  solid,  lattice  translational  symmetry 
simplifies  the  problem,  and  band-structure  techniques  can  be  applied  to 
obtain  the  solution.  Because  the  strain  energy  is  many  orders  of  magnitude 
smaller  than  the  total  energy,  very  precise  computation  is  required  if  one 
hopes  to  obtain  meaningful  elastic  constants.  So  far,  at  least  two  band- 
structure  metliods  have  been  demonstrated  as  reliable  for  all  three  elastic 
constants:  the  plane- wave  method  using  pseudopotentials  (PP-PW)  (Nielsen 
and  Martin,  1983,  1985a),  and  the  full-potential  linearized-muffin-tin-orbital 
method  (FP-LMTO)  (Methfessel  etai,  1989).  Although  the  full-potential 
argumented  plane-wave  method  (FP-APW)  (Krahauer  et  ai,  1979;  Wimmer 
etai,  1981;  Wei  and  Krahauer,  1985;  Ferreira  etfl/.1989)  has  produced 
excellent  lattice  constants,  structural  energies,  and  bulk  moduli,  the  complete 
semiconductor  elastic  constants  based  on  this  method  are  not  yet  available. 

Even  if  the  total  energies  at  different  distortions  can  be  calculated 
accurately,  there  is  still  the  problem  of  searching  for  the  equilibrium  atomic 
positions  in  a  distorted  crystal,  and  the  numerical  determination  of  elastic 
constants  from  energy  differences.  If  the  strain  energy  can  be  calculated 
directly  without  taking  the  difference  between  two  large  energies,  or  if  the 
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derivatives  can  be  calculated  directly,  not  only  can  the  computation  time  be 
shortened,  but  the  numerical  errors  will  also  be  reduced.  The  quantum 
mechanical  theory  of  forces  and  stresses  of  Nielsen  and  Martin  (1985b)  and 
the  closely  related  direct  calculation  of  elastic  constants  from  linear  response 
theory  of  Baroni  et  al.  (1987)  represent  the  status  of  efforts  in  this  direction. 
The  former  has  been  carried  out  for  all  three  elastic  constants  for  Si,  Ge  and 
GaAs  (Nielsen  and  Martin,  1985a),  while  the  latter  has  been  done  only  on  the 
bulk  modulus  for  Si;  both  are  based  on  the  PW-PP  method.  Table  II  shows  a 


TABLE  II 


Comparison  between  calculated  and  experimental  constants* 


Expt" 

FP-LMTO' 

PP-PW^ 

TB 

SI 

a 

5.431 

5.41 

5.45 

5.431 

B 

9.923 

9.9 

9.3 

9.923 

Cu  - 

^  1  2 

10.274 

10.2 

9.8 

10.274 

C44 

8.036 

8.3 

8.5 

8.013 

11.1 

11.30 

c 

0.54' 

0.51 

0.53 

0.51 

w 

523 

518 

521 

572 

Ge 

a 

5.65 

5.59 

5.65 

B 

7.653 

7.2 

7.653 

1 

Cu 

8.189 

8.5 

8.189 

6.816 

6.3 

6.84 

0°’ 

7.7 

9.46 

c 

0.44 

0.49 

(jj 

303 

302 

342 

GaAs 

a 

5.642 

5.55 

5.642 

B 

7.69 

7.3 

7.69 

^11  * 

^12 

6.63 

7.0 

6.63 

6.04 

6.2 

5.79 

7.5 

7.83 

c 

0.48 

0.50 

0} 

273 

268 

292 

‘’Comparison  between  calculated  and  experimental  lattice  constant  a,  elastic  constants  B, 
C,,  —  C,2  and  C44,  Kleiman  (1962)  internal  distortion  parameter  C,  and  the  TO  optical 
phonon  w  in  wave  numbers  1/cm.  Also  listed  arc  defined  in  Eq.  (13).  The  FP-LMTO  and 
PP-PW  are  the  ab  initio  theories  described  in  Part  1,  and  TB  is  the  tight-bonding  theory  in 
Part  III,  Section  10.  All  clastic  constants  are  in  units  of  10“  dynes/cm^. 

'’All  the  experimental  lattice  constants  arc  those  tabulated  by  Zallen  (1982).  The  experimen¬ 
tal  elastic  constants  are  taken  from  Table  III.  and  the  phonon  frequencies  are  from  Table  V. 

'Methfessel  et  al.  (1989). 

^Nielsen  and  Martin  (1985a). 

'Cousins  et  al.  (1987). 
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comparison  between  theoretical  calculations  and  experimental  results.  From 
this  comparison,  it  is  fair  to  say  that  we  have  a  very  reliable  ab  initio  theory 
for  the  elastic  constants  for  crystalline  semiconductors  based  on  the  self- 
consistent  local  density-functional  theory.  Note  that  Table  II  also  lists  the 
results  from  an  empirical  tight-binding  (TB)  theory  to  be  discussed  in 
Section  10. 


9.  Valence  Force  Field  Model 

The  preceding  section  showed  that  ab  initio  theory  for  the  elastic  constants 
requires  complicated  computations.  Accurate  ab  initio  calculations  for  semi¬ 
conductors  have  been  obtained  only  recently  and  only  for  several  systems. 
On  the  other  hand,  phenomenological  microscopic  models  of  elastic  con¬ 
stants  for  all  semiconductors  have  been  available  for  some  time.  Of  these,  the 
valence  force-field  model  (VFF)  is  perhaps  the  simplest  and  the  most  useful. 
This  topic  has  been  reviewed  and  well  analyzed  in  a  paper  by  Martin  (1970), 
and  its  conclusions  constitute  the  main  body  of  this  section. 


a.  Diamond  Structure 

The  original  VFF  model  by  Musgrave  and  Pople  (1962)  was  for  the 
diamond  structure,  in  which  the  elastic  energy  is  a  quadratic  form,  in  terms  of 
the  changes  in  each  bond  length  Ar^,  in  bond  angles  A0,j,  and  in  the  products 
Ar^Arj  and  A0,y,  between  nearest-neighbor  bonds.  For  elastic  constants, 
Keating  (1966)  showed  that  the  VFF  can  be  simplified  by  the  following 
approximation  to  the  elastic  energy  of  the  crystal: 

^  I  [A(ri  •  ^  I  [A(f,  •  f,)]2 .  (72) 

where  the  bond  index  i  runs  over  all  the  bonds,  but  the  i  and  j  sum  only  over 
those  pairs  of  bonds  that  are  connected  to  a  common  atom.  In  Eq.  (72),  d  is 
the  equilibrium  bond  length  and  A(rj  •  fj)  is  the  change  in  the  dot  product  of 
the  two  bond  vectors  that  start  at  the  common  atom,  point  along  the  bord 
directions,  and  end  at  the  first  neighbor  atoms.  Following  the  calculational 
procedure  described  in  Section  2  for  uniform  expansion,  the  U  in  the  VFF 
under  a  uniform  expansion  can  be  shown  tobe  U  =  2(3a  -f  where 

Q  =  a^/A  is  the  equilibrium  volume  per  unit  cell.  Thus, 

B  =  (Cn  +  2C,,)f3  =  (a  -f-  )?/3)/a .  (73) 
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For  the  shear  strain  described  by  Eq.  (9),  Eq.  (72)  yields  U  =  A^e^/a.  Thus, 
according  to  Eq.  (10), 


^11  (74) 

For  the  shear  strain  described  by  Eq.  (11)  and  with  an  internal  displacement 
u  =  (0, 0,  u),  Eq.  (72)  yields  the  following  expression; 

U  =  lci(e  -  ti)^  +  P(e  +  rjfyi^a) ,  (75) 

where  u  =  rja/4.  A  comparison  between  Eqs.  (13)  and  (75)  shows  that 
<t>  =  16(a  +  /?)/n^,  D  =  —  4(a  —  ^)/fl^,  and  C‘4^  =  (a  +  ^)/fl.  Using  these  re¬ 
sults  in  Eq.  (14),  we  find  that  the  Kleiman  internal  displacement  parameter  in 
the  present  model  is  given  by 

C  =  (a  -  moi  +  i?)  =  2Ci2/(Cn  +  C,^).  (76) 

Equation  (15)  then  produces 

C44  =  2a^/[(a  +  m  •  (77) 

The  three  elastic  constants  given  above  are  not  independent  and  can  be 
shown  to  related  to  each  other  by  the  Keating  identity  (1966),  or 

/k  =  2C,»(C„  +  C.2)/[(C,.  -  C„XC..  +  3C„)]  =  1 .  (78) 


b.  Zincblende  Structure  and  Coulomb  Force 

The  Keating  identity  (1966)  holds  very  well  for  systems  with  the  diamond 
structure  but  not  so  well  for  the  zincblende  compounds  (see  Table  III).  One 
obvious  difference  between  the  two  structures  is  the  presence  of  Coulomb 
interactions  arising  from  charge  shifts  between  the  cation  and  anion  sublat¬ 
tices  in  zb  semiconductors.  Martin  (1970)  incorporated  Blackman’s  (1959) 
treatment  of  the  Coulomb  forces  in  the  Keating  VFF  in  the  following 
manner.  First,  the  Coulomb  energy  was  treated  as  a  screened  Madelung 
energy  For  example,  in  a  uniformly  expanded  crystal  with  a  bond  length 
r,  the  Coulomb  energy  was  taken  to  be  =  —  Na.t^*^e^/{Er),  where  N  is  the 
total  number  of  unit  cells,  aw  =  1.6381  is  the  Madelung  constant,  and  Z*^/£  is 
the  effective  charge  defined  by  the  optic-mode  splitting; 


S  =  Z*V£  =  ~  ^‘^?)/(4^e^) .  (79) 
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TABLE  III 


Experimental  values  for  cubic  semiconductors* 


Cu 

^12 

c** 

5 

a 

U 

iu 

^MOt* 

c 

107.640 

12.520 

57.740 

0.0 

129.100 

84.573 

1.00 

1.00 

1.02 

Si‘ 

16.772 

6.498 

8.036 

0.0 

49.247 

13.951 

1.00 

1.00 

1.13 

Ge*- 

13.112 

4.923 

6.816 

0.0 

39.438 

11.583 

1.08 

1.08 

1.05 

AlSb" 

8.769 

4.341 

4.076 

1.684 

33.768 

6.653 

1.11 

1.05 

1.08 

Gap 

14.390 

6.520 

7.143 

3.815 

46.965 

10.448 

1.12 

1.05 

1.08 

GaAs* 

12.110 

5.480 

6.040 

2.827 

40.895 

9.159 

1.12 

1.06 

1.05 

GaSb" 

9.089 

4.143 

4.440 

1.569 

33.123 

7.412 

1.10 

1.06 

1.06 

InP" 

10.220 

5.760 

4.600 

3.766 

41.095 

6.250 

1.20 

1.07 

1.03 

InAs" 

8.329 

4.526 

3.959 

2.820 

33.744 

5.531 

1.22 

1.11 

1.00 

InSb* 

6.918 

3.788 

3.132 

1.372 

29.909 

4.951 

1.17 

1.11 

1.05 

ZnS' 

9.420 

5.680 

4.360 

6.788 

37.026 

4.571 

1.33 

1.07 

0.95 

ZnS' 

10.790 

7.220 

4.120 

6.788 

45.126 

4.341 

1.28 

1.02 

1.01 

ZnS' 

9.810 

6.270 

4.483 

6.788 

39.947 

4.300 

1.42 

1.13 

0.90 

ZnS' 

10.460 

6.530 

4.630 

6.788 

41.880 

4.828 

1.33 

1.08 

0.95 

ZnSe*’ 

8.95 

5.39 

3.984 

4.368 

34.432 

4.716 

1.28 

1.09 

0.98 

ZnSc' 

8.59 

5.06 

4.06 

4.368 

34.519 

4.673 

1.32 

1.13 

0.95 

ZnSe' 

8.720 

5.240 

3.920 

4.368 

35.469 

4.603 

1.29 

1.10 

0.98 

ZnTe' 

7.130 

4.070 

3.120 

2.566 

29.976 

4.452 

1.18 

1.06 

1.05 

ZnTe' 

7.220 

4.090 

3.080 

2.566 

30.204 

4.558 

1.14 

1.03 

1.08 

CdTe‘ 

5.33 

3.65 

2.04 

3.105 

27.058 

2.455 

1.34 

1.07 

0.98 

CdTe' 

6.150 

4.300 

1.960 

3.105 

31.546 

2.731 

1.16 

0.94 

1.13 

HgTe* 

5.971 

4.154 

2.259 

2.381 

30.300 

2.542 

1.37 

1.16 

0.96 

HgTe' 

5.63 

3.66 

2.11 

2.381 

26.919 

2.542 

1.37 

1.15 

0.95 

"Experimental  elastic  constants  for  some  cubic  semiconductors  and  the  parameters  of  Eq. 
(81)  taken  from  Martin  (1970),  with  the  force  constants  a  and  P  obtained  from  Eqs.  (82)  and 
(83)  and  the  identity  relations  l^,  and  /bqm  given  by  Eqs.  (78),  (88),  and  (119),  respec¬ 
tively.  The  elastic  constants  are  in  units  of  10*’  dynes/cm^  and  the  force  constants  are  in 
10^  dynes/cm. 

^Data  quoted  from  ‘'Landolt-Bomstein  Numerical  Data  and  Functional  Relationships  in 
Science  and  Technology,”  New  Series,  Vols.  17  and  22. 

'Listed  in  the  review  by  Mitra  and  Massa  (1982). 


In  Eq.  (79),  coi  and  co,  are,  respectively,  the  longitudinal  and  transverse 
phonon  frequencies  in  the  long-wave-length  limit.  Then,  to  counterbalance 
the  Coulomb  forces,  a  repulsive  force  term  was  added  and  assumed  to 
contribute  to  the  bond-stretching  energy  in  the  form 

AEr  =  —  Z  ArJiAEd^) .  (80) 


With  the  above  two  contributions  added,  the  total  strain  energy  is  AEj  = 
AE  +  A£m  +  A£r,  where  A£  is  the  VFF  contribution  in  Eq.  (72)  and  A£m  is 
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the  change  in  the  Madelung  energy.  The  energies  A£j  are  expanded  in  a 
power  series,  and  only  terms  up  to  the  second  power  in  the  strain  are  kept. 
The  A£m  contributions  arising  from  fixed  values  of  the  charge  shift  S  =  2*^/£ 
on  the  atomic  sites  under  different  strains  were  worked  out  by  Blackman 
(1959).  Using  these  results  and  defining 

s  =  ,  (81) 

Martin  (1970)  obtained  the  following  modified  expressions  for  the  elastic 
constants; 


Cii +2Ci2  =  (3a +  ^)/a-  0.355s,  (82) 

Cii  -  2Ci2  =  +  0.053s,  (83) 

C  =  [(a  -  -  0.294s]/Cm  ,  (84) 

=  (a  +  -  0.136s  -  CmC'  ,  f85) 

where  is  defined  as 

CM  =  (a  +  ^)/a-0.266s.  (86) 

The  above  equations  can  be  comb  : /.d  to  yield 

C  =  (2Ci2  -  C')/(C„  +  Ci2  -  C'),  (87) 


where  C  =  0.314s.  Since  the  extra  parameter  s  is  fixed  by  the  optic  modes  and 
the  bond  length,  the  above  results  combine  into  a  new  identity,  the  Martin 
identity  (1970): 


2C44,(Cii  +  Ci2  —  C) 


(Cn  -  Ci2)(Cii  +  3Ci2  -  2C')  +  0.831C'(Cii  +  -  C') 


.(88) 


Table  III  lists  a  set  of  experimental  values  of  the  elastic  constants  and  the  s 
values  for  a  number  of  diamond  and  zincblende  semiconductors.  These 
values  are  used  to  compute  the  force  constants  a  and  ^  and  the  identity 
expressions  and  given  in  Eqs.  (78)  and  (88),  respectively.  Several  sets  of 
data  are  quoted  for  some  of  the  systems  to  show  the  uncertainties  in  the 
experiments  for  these  systems.  The  table  results  clearly  show  that  the 
inclusion  of  the  Coulomb  energies  improves  the  identity  relation;  the 
deviations  of  1^  from  unity  are  15%  or  less.  Also  listed  are  the  values  for 
another  identity  relation,  /goM  Eq.  (119),  based  on  a  tight-binding 
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model  to  be  discussed  in  Section  10.  Martin  further  studied  trends  as 
functions  of  the  bond  lengths  d  and  the  ionicity  scale /;  of  Phillips  (1973)  and 
Van  Vechten  (1969).  He  found  that  a  scales  roughly  as  \/d^,  i.e., 

ad^le^  =  constant ,  (89) 

where  e  is  the  electron  charge.  He  also  found  the  ratio  between  the 
bond-angle  and  bond-stretching  forces  tend  to  decrease  as  /;  increases  and 
scales  roughly  as 


/?/aocl-/i.  (90) 

He  further  observed  that  if  S  of  Eq.  (79)  is  set  equal  to  and  if  the  a  and  ^ 
values  are  extrapolated,  using  Eqs.  (89)  and  (90),  from  those  fitted  to  the 
average  values  of  the  B  and  2  for  Si  and  Ge,  then  all  the  elastic 

constants  can  be  predicted  from  Eqs.  (82)  through  (86)  to  an  accuracy  of  10%. 

It  is  interesting  to  compare  Eq.  (82)  using  the  results  of  Eqs.  (89)  and  (90), 
with  Cohen’s  (1985)  empirical  formula  for  the  bulk  modulus 

P  =  (1971  -  22A)/(i^  %  (91) 

where  B  is  in  GPa,  d  in  A,  and  A  =  0, 1,  and  2,  respectively,  for  the  group 
IV,  II-V,  and  II-VI  semiconductors.  Both  Martin  and  Cohen's  formulas  give 
B  values  to  better  than  10%  for  all  materials  tabulated  in  Table  III.  The  B  in 
Eq.  (91)  scales  as  l/d^  while  in  VFF  it  scales  as  \/d*. 


10.  Tight-Binding  Theory 

In  the  semi-empirical  tight-binding  (TB)  approach,  the  total  energy  of  a 
semiconductor  crystal  is  assumed  to  be  the  sum  of  the  electron  energies  E^(k) 
in  the  valence  bands  plus  repulsive  pair  energies  Ujj  between  the  nearest- 
neighbor  atoms  (Chadi,  1978): 

£,  =  £,.,+ =  +  (92) 

V  k  i>  J 

Furthermore,  the  band  energies  are  interpolated  by  using  a  TB  Hamilto¬ 
nian  that  contains  term  values  of  the  atoms,  and  a  handful  of  interaction 
parameters  between  orbitals  of  the  neighboring  atoms.  Despite  the  simplicity 
of  Eq.  (92),  recent  first-principles  theories  have  given  some  support  to  this 
approximation. 
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One  virtue  of  the  TB  approach  over  valence  force-field  models  is  that  it  is  a 
quantum  theory  without  much  complication.  As  compared  to  first-principles 
theory,  the  TB  approach  is  easier  to  execute,  particularly  when  applied  to 
complicated  systems  such  as  alloys  and  superlatices.  In  actual  applications, 
the  TB  calculation  either  is  carried  out  using  the  full  band-stru  iure 
calculation  (BS),  or  is  approximated  by  simpler  local  theory  such  as 
Harrison’s  (1980,  1983a,  and  1983b)  bond  orbital  model  (BOM). 


a.  Band  Structure  Calculations 

The  simplest  TB  Hamiltonian  contains  the  s  and  p  atomic  term  values  c, 
and  £p  for  both  cations  and  anions,  and  the  nearest-neighbor  two-center 
interactions  V„„,  and  Vpp,.  To  be  more  explicit,  the  8  x  8 

/c-dependent  Hamiltonian  contains  the  term  values  as  the  diagonal  matrix 
elements,  while  the  off-diagonal  matrix  elements  between  the  cation  and 
anion  orbitals  are  given  by 

=  'MJ),  (93) 

i 

where  the  sum  runs  over  the  four  first  neighbor  atoms  specified  by  the  bond 
displacements  d.  The  ys  are  the  orbital  indices  for  s,  p„  Py,  and  p..  The  hy^. 
values  are  related  to  the  two-center  K’s  by  the  Slater-Koster  (1954)  relations: 


=  V 

(94) 

=  ‘^iKpc, 

(95) 

=  +  (J  - 

(96) 

(97) 

where  a,  =  xjd  are  the  direction  cosines  of  d  and  the  K’s  depend  only  on  the 
length  d. 

Once  the  values  of  these  TB  parameters  and  their  dependences  on  the  bond 
length  are  known,  the  Hamiltonian  at  each  k  inside  the  Brillouin  zone  (BZ) 
can  be  evaluated,  and  the  summation  of  k  carried  out  to  obtain  the  band- 
structure  energy,  which,  when  added  to  the  repulsive  energy,  gives  the  total 
energy  of  any  specified  geometry.  All  the  elastic  constants,  associated  internal 
displacements,  and  transverse  optical  phonon  frequencies,  are  easily  calcu¬ 
lable  following  the  procedure  of  Section  2.  The  only  point  to  note  is  the  k- 
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sum,  which  without  strain  can  be  calculated  accurately  by  using  the  10  spe¬ 
cial  k  points  (Chadi  and  Cohen,  1973)  in  the  irreducible  wedge  of  the  BZ. 
Under  strain,  the  crystal  symmetry  changes;  one  needs  to  extend  these  special 
k  points  to  other  nonequivalent  wedges.  However,  since  the  sum  of  the 
valence-band  energies  as  a  function  of  /c  is  a  rather  smooth  function,  a 
uniform  sampling  over  the  whole  BZ  converges  very  quickly.  A  5  x  5  x  5 
grid  is  sufficiently  accurate  for  the  required  calculations.  To  avoid  the 
numerical  inaccuracy  inherent  in  direct  energy  subtractions,  one  can  also 
calculate  the  second  derivatives  directly  by  using  perturbation  theory. 

Perturbation  theory  starts  with  the  expansion  of  the  fc-dependent  Hamilto¬ 
nian  H  in  powers  of  the  infinitesimal  strain  parameter  e,  keeping  terms  up  to 
second-order; 


Hik)  =  Ho  +  H,e  +  ^H2e\ 


(98) 


where  Hq  is  the  strain-free  Hamiltonian,  and  Hi  and  //j  are,  respectively,  the 
first  and  second  derivatives  with  respect  to  e  evaluated  at  e  =  0.  The  oand 
energy  contribution  to  the  strain  coefficient  then  comes  from  the  second 
derivative  of  with  respect  to  e,  denoted  by 


de^ 


V  k  V  t  k 


\(vklHi\ck)l^ 

e^{k)-B^(k) 


(99) 


where  e^ik)  and  \ck}  are,  respectively,  the  eigen  energies  and  eigenvectors  of 
Ho  for  the  conduction  bands,  and  vk  stands  for  the  valence  bands.  Note  that 
the  inter-valence-band  contributions  in  the  second-order  perturbation  sum 
cancel  exactly  so  they  are  not  needed  in  Eq.  (99).  To  evaluate  these  matrix 
elements  one  needs  to  have  the  first  and  the  second  strain  derivatives  of  the 
two-center  interactions  and  the  direction  cosines  a^.  For  the  strain  para¬ 
meters  e  defined  in  Section  2,  and  for  the  two  center  interactions  V  that  scale 
as  1  Id",  the  following  results  are  useful. 

(i)  For  the  bulk  modulus,  the  direction  cosines  do  not  change,  and  we  have 

dVjde  =  -nV  and  =  n{n  -I-  1)K 


(ii)  For  Cl  1  —  Cl  2  with  e  specified  in  Eq.  (9),  we  get 

dtxjde  =  cii(5ii  -  612),  d^ajde^  =  aiiXSn  +  612)  -  4]/3,  dV/de  =  0, 
and 

c'^Vde^  =  AnVjl). 
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(iii)  For  C44  with  the  strain  e  given  in  Eq.  (11)  and  an  internal  displace¬ 
ment  u,  we  find 

dVjde  =  — naja^K  dVjdu  =  noL^v/d, 
doL-Jde  —  +  <5,-2ai)/2  —  a.-aja,, 

dn-J^u  =  +  ctiCLyld,  d^V/de^  =  n(n  —  1)179, 

d^V/du^  =  n(n  -  l)V/(3d^), 
d^oLjce^  =  -a,(6a  -i-  5.-2)/12, 
d^VI{dedu)  =  —n{n  -f  2)ctiCi20Ly/d, 

and  finally 

c^!xj{dedu)  =  ((J.iaiaj  +  Si2!X^o^2  +  2^,-3aia2  —  6a,aia2a3)/(2i). 

Also  note  that  three  second  derivatives  of  the  band-structure  energy  are 
needed,  namely  d^Ey,Jde^,  c^E^^J{decu),  and  d^E^Jdu^  for  the  evaluation  of 
D,  and  <f>  of  Eq.  (13),  respectively. 

b.  Bond  Orbital  Model 

Harrison’s  (1980,  1983a,  and  1983b)  bond  orbital  model  (BOM)  empha¬ 
sizes  calculations  of  the  TB  total  energy  in  terms  of  local  energies.  One  special 
feature  of  the  BOM  is  its  universality.  Another  feature  is  that  its  simple  and 
often  analytical  forms  provide  direct  insight  into  the  essential  physics. 
Although  BOM  aims  at  predicting  trends,  it  is  reasonably  accurate  in  many 
cases. 

The  band-structure  energy,  or  the  center  of  gravity  of  the  valence  band,  in 
BOM  is  computed  in  the  following  steps.  The  terms  involved  are  indicated  in 
Fig.  2. 

(1)  Construct  the  sp^  hybrid  orbitals  \h}  for  each  atom;  these  hybrid 
orbitals  are  directed  toward  the  neighboring  atoms.  For  example,  the  hybrid 
in  the  [1 1 1]  direction  is  given  by 

\h}  =  (ls>  |p,>  -h  \p,}  +  |p,»/2.  (100) 

The  hybrid  energy  is  then  given  by  =  <fi|//i/2>  =  (s,  +  3£p)/4.  In  the 
zincblende  structure,  the  cation  hybrid  energy  is  in  general  different  from 
the  anion  hybrid  energy  Eh  -  Note  that  the  two  hybrid  orbitals  of  the  same 
atom  but  in  two  different  directions  are  now  coupled  by  the  so-called  metallic 
energy  =  <^h\H\h'y  =  (e^  -  £p)/4. 

(2)  Construct  the  bonding  and  antibonding  molecular  orbitals,  |h>  and 
|a>,  from  the  two  hybrid  orbitals  \h^y  and  \h^y  directed  toward  each  other 
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Fig.  2.  Labels  of  the  interactions  between  hybrids  associated  with  an  adjacent  anion-cation 
pair.  is  the  interaction  between  two  hybrids  on  the  same  anion  (cation);  the  “covalent 

energy”  Vj  is  the  interaction  between  anion  and  cation  hybrids  that  point  toward  one  another 
along  the  bond  direction;  V]  is  the  interaction  between  an  anion  hybrid  in  one  direction  and  an 
adjacent  cation  hybrid  pointing  in  a  different  direction.  The  “ionic  energy”  Kj  is  half  the 
difference  betwen  cation  and  anion  term  values.  The  lower  segment  of  the  figure  depicts  the 
splitting  of  the  hybrid  energy  levels  by  the  Fj  and  Fj  interactions. 


along  the  same  bond  by  diagonalizing  a  2  x  2  matrix  with  and  on  the 
diagonal,  and  =  </i'|H|fi''>  as  the  off-diagonal  matrix  elements.  The 

resulting  energies  for  |6>  and  |a>  are  Cb  =  + 

Jy\  +  yi,  respectively,  where  =  {el  +  £b)/2  is  the  mean  hybrid  energy, 
1^2  is  the  covalent  energy,  and  V'a  *=  (4  —  Ch)/2  is  the  polar  energy.  The  eigen 
states  can  also  be  written  explicitly  in  terms  of  these  energies: 

|i.>  =  V(1  +  X,y2\h''}  +  V(1  -  ap)/2|A‘>,  (101) 

|a>  =  -  V(1  -  ap)/2|(i->  +  V(1  +  a,)/2|).‘>,  (102) 
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where  Kp  is  called  the  polarity  and  is  defined  as 

ap  =  (103) 

(3)  The  quantity  would  be  the  center  of  gravity  of  the  valence  bands,  if 
interactions  between  states  on  different  bonds  were  neglected.  Harrison 
(1983b)  incorporated  these  interactions  in  a  perturbation  theory  in  which  the 
change  of  the  bonding  energy  is  given  by 

A£,  =  Xl<6|H|a'>|V(£b-£.).  (104) 

a' 

where  the  sum  runs  over  the  antibonding  states  of  the  surrounding  bonds. 
Note  that  the  interactions  among  the  bonding  states  lead  to  the  formation  of 
the  valence  bands,  but  do  not  shift  their  center  of  gravity;  therefore,  these 
interactions  need  not  to  be  considered  in  the  total  energy  calculation. 
Including  the  energy  correction  Ae^,,  the  final  band-structure  energy  per  bond 
(which  contains  two  electrons)  is  given  by 


£b  =  2€b-h2A£b.  (105) 

Harrison  (1980)  referred  to  the  second  term  as  the  metallization  energy. 
Besides  providing  a  simple  means  for  evaluating  the  center  of  gravity  of  the 
valence  electron,  Harrison’s  BOM  also  provides  a  set  of  universal  TB 
parameters.  Based  on  comparison  with  the  free-electron  band  width  (Froyen 
and  Harrison,  1979)  and  with  empirical  TB  parameters,  Harrison  (1983b) 
deduced  the  following  set  of  universal  two-center  interactions; 

(106) 

with  =  - 1.32,  =  1.42,  =  2.22,  and  =  -0.63,  where  m  is  the 

free-electron  mass  and  d  the  bond  length.  In  units  where  d  is  in  A  and  V  in  eV, 
=  7.62  The  pair-repulsive  energy  u  in  the  BOM  is  taken  as 

resulting  from  the  overlap  of  wavefunctions  of  the  orbitals  on  the  two  centers 
and  was  shown  to  have  the  form 


u  =  uMd)\  (107) 

where  is  the  equilibrium  bond  length.  The  value  of  Uq  is  determined  by 
requiring  that  dg  is  the  experimental  value.  Note  that  the  d  dependences  of 
both  and  u  are  taken  to  be  the  proper  scaling  not  only  among  different 
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systems,  but  also  within  the  same  system,  as  the  bond  length  varies  under 
distortions. 

Under  these  assumptions,  the  bond  energy  £bond.  which  is  defined  as  the 
difference  between  the  energy  per  bond  in  a  semiconductor  and  the  average 
energy  per  two  electrons  in  the  free  atoms,  i.e.,  £bond  =  +  “o  ~  2£,  takes 

the  following  simple  form  for  a  nonpolar  semiconductor  (Harrison  1983a): 

n(l -«m  +  9ajyi6).  (108) 

where  a„  is  called  the  metallicity  and  is  defined  as  For  a  polar 

semiconductor,  £bond  becomes  slightly  more  complicated: 

E^  =  2l,-2l-2iVl+Vl)^'^ 


»'!)  ■ 


(109) 


where  is  called  the  covalency,  and  V,c  and  are  the  metallic 

energies  for  the  cation  and  the  anion,  respectively.  The  bulk  modulus  also 
takes  a  very  simple  form;  for  a  group-IV  semiconductor,  it  reads 


B  =  -2Kj(l  -  9ai/16)/(73  P),  (1 10) 


and  for  a  polar  semiconductor,  it  becomes 


B=  -2V, 


at 


-8"' 


l(5«i  -  +  V\^)l(Vl  +  K|) 


(111) 


These  expressions  show  that  the  bulk  modulus  varies  as  in  the  pure 
covalent  case,  and  as  in  the  extreme  ionic  limit  Kj  >  Fj.  Note  that  this 
result  is  different  from  the  dependence  in  Cohen’s  (1985)  formula  and 
the  \ld*  scale  in  VFF. 

Shear  strains  cause  a  semiconductor  to  shift  away  from  perfect  tetrahedral 
symmetry.  To  deal  with  the  shear  elastic  coefficient,  the  BOM  has  to  be 
modified.  A  simple  approximation  is  the  rigid  hybrid  model  (Harrison, 
1983b;  van  Schilfgaarde  and  Sher,  1987a  and  b),  in  which  the  hybrid  orbitals 
of  each  atom  are  assumed  to  remain  in  their  original  tetrahedral  directions 
despite  the  lattice  distortion.  Then  the  hybrids  of  two  nearest-neighbor  atoms 
making  up  the  bonding  and  antibonding  states  no  longer  are  directed  toward 
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each  other,  as  shown  in  Fig.  3.  There  is  a  misalignment  angle  6  between  each 
hybrid  and  the  line  connecting  the  two  atoms,  and  the  convalent  energy  Fj  is 
given  by 

j  2v/3  cos  0  Kp.  -  cos^  0  +  3(1  -  cos'  .  (112) 

The  lowest-order  change  3V2,  caused  by  an  infinitesimal  angular  misalign¬ 
ment  SO,  is  then  given  by 

=  +  (113) 

Under  the  strain  e  described  in  Eq.  (9)  for  Cjj  —  Cjj ,  there  is  no  bond  length 
change,  and  (SO)^  =  le^l2>.  If  one  assumes  that  the  metallization  coupling  is 
only  through  the  metallic  energies  and  as  has  been  assumed  so  far, 
then  the  change  of  the  crystal  energy  is  the  change  of  the  band-structure 
energy  due  to  31^2-  Then,  according  to  Eqs.  (10)  and  (105), 

C  -C  =  — —  3K,/c' 

=  ^«.(v/3K„  +  3V;„-3V)  (114) 

X  [1  +  (j  -  +  y\Mvi  +  I'DJ. 

Harrison  (1983b)  has  pointed  out,  however,  that  in  addition  to  Kj,  other 
interactions  such  as  V\  shown  in  Fig.  2  produce  important  contributions  to 
the  shear  elastic  constant.  By  arguing  that  these  other  contributions  must 
cancel  those  associated  with  the  change  5V2  arising  from  the  metallization 
energy  in  a  rigid  rotation,  Harrison  deduced  the  following  expression; 

c„  -  c.,  =  4  3K,„  -  3K,J.  (115) 

Under  the  strain  e  for  the  C44  given  in  Eq.  (11)  and  with  an  internal 
displacement  given  by  u  =  rfd/y/3  as  described  in  Section  2,  the  bond 
misalignment  angles  for  the  four  bonds  have  the  same  magnitude  with 
(30)-  =  2(r}  -I-  e/2)^/9.  The  four  bond  lengths  also  change,  with  the  change  for 
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Fig.  3.  A  schematic  picture  of  the  hybrids,  treated  as  rigid,  in  a  shear  distortion  leading  to 
1  ~  ^12' 

one  pair  given  by  =  Sr2  =  S  +  e  and  by  =  —d-¥e  for  the  other 

pair,  where  <3  =  (e  —  t\)dl2>  and  e  =  (ri  +  e!2fdl9.  If  again  one  assumes  that 
the  metallization  is  only  through  ,  then  the  strain  energy  density  can  be 
shown  to  be  given  by 

U  =  9BS^I{2d^)  +  3(Cii  -  C,2)iW  n  16^ 

=  Bie  -  rjy/2  +  (C„  -  +  e/2)V3.  ^  ^ 

For  a  given  strain  e,  U  can  be  minimized  with  respect  to  rj,  which  yields  the 
Kleiman  displacement  parameter  C  =  with  C  given  by 

C  =  (B-C/3)/(B  +  2C/3),  (117) 

where  C  =  -  Ci2-  Finally,  from  U  =  C^^e^l2,  the  following  relation¬ 

ship^  is  established : 


9/C*^  =  6/C  +  4/B, 


(118) 


or 


/bom  =  9BC/CC44  (6B  +  4C)]  =  1.  (1 19) 

''  This  is  an  expression  corrected  from  one  published  previously  (van  Schilfgaarde  and  Sher, 
1987a  and  b). 
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If  one  includes  the  effect  of  the  energy  density  will  involve  an  additional 
term  that  couples  Sr  and  SO.  Then  the  analysis  is  no  longer  simple. 

The  preceding  explicit  formulas  for  the  elastic  constants  in  BOM  are  not 
much  more  complicated  than  the  valence  force-field  model.  However,  they 
relate  macroscopic  forces  to  intrinsic  atomic  interactions.  It  is  interesting  to 
note  that  the  simple  identity  relation  of  Eq.  (119)  holds  very  well.  As  can  be 
seen  in  Table  III,  this  result  is  certainly  better  than  the  of  Eq.  (78),  and  is 
very  competitive  to  Martin’s  identity  7^,  which  requires  the  inclusion  of  his 
particular  treatment  of  Coulomb  forces.  Because  the  Coulomb  energy  is  not 
included  explicitly  in  Eq.  (119),  its  contribution  to  the  elastic  constants  is 
probably  small. 


c.  Numerical  Results  and  Quantitative  Applications 

To  study  the  quantitative  aspect  of  the  theory,  one  first  needs  to  establish 
the  TB  parameters.  Table  IV  lists  the  term  values  we  will  use.  The  values  of 
the  outermost  valence  levels  are  taken  to  be  minus  the  experimental  first 
ionization  energies  listed  in  Kittel’s  (1986)  book,  and  the  other  term  values 
are  deduced  from  calculated  extraction  or  promotion  energies  using  norm- 
conserved  atomic  pseudopotentials  (Bachelet  et  ai,  1982).  These  term  values 
are  very  similar  to  Mann’s  (1967)  Hartree-Fock  calculations  used  by- 
Harrison  (1980),  which  are  also  given  in  Table  IV.  The  major  difference  in  the 
two  sets  occurs  in  heavy  elements,  where  relativistic  s-shifts  are  important, 
but  were  not  included  in  Mann’s  results. 

Table  V  lists  the  values  of  bond  lengths,  bond  energies  Eboncn  elastic 
coefficients  B,  C  =  —  Cjj,  and  C4.4,  and  the  zone-center  TO  phonon 

frequencies  cu  for  a  selected  group  of  systems  to  be  examined  in  the  remainder 
of  this  section.  Experimental  values  are  also  presented,  with  the  exception  of 
extrapolations  for  the  elastic  constants  of  AIP  and  AIAs.  Table  VI  compares 
results  between  the  BOM  and  the  full  band-structure  (BS)  calculation  using 
Harrison’s  universal  TB  parameters.  Except  for  the  Ge  and  HgTe  values,  the 
agreement  between  the  two  calculations  for  E^^j  is  within  10%  or  better.  The 
calculated  Ebond  values  are  also  in  fair  agreement  with  the  experimental 
values  except  for  diamond.  Since  diamond  has  a  much  smaller  bond  length 
than  the  rest  of  the  systems,  this  discrepancy  is  an  indication  of  a  limit  to  the 
scaling  rules  for  both  and  u.  Although  the  trends  for  the  bulk  moduli  from 
both  calculations  are  similar,  the  caiculational  errors  in  the  BOM  can  be  as 
large  as  50%.  Also  note  that  the  calculated  values  of  B  for  most  systems  are 
only  about  one-half  of  the  experimental  values.  The  tabulated  values  of  C  and 
C44  for  BOM  are  based  on  Eqs.  (115)  and  (118),  respectively.  Considering  the 
simplicity  of  these  formulas,  the  agreement  with  the  band  structure  calcula- 
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TABLE  IV 


TERMVALUESUSED1NTHEPRESENTCALCULATIONSANDMaNN'S(1967)HaRTREE- 
Fock  values  as  used  by  Harrison  (1980). 


Element 

Present 

Mann 

Cu 

-7.72 

-2.96 

-7.72 

-2.37 

Ag 

-7.57 

-3.54 

-7.06 

-2.61 

Au 

—  9.22 

-3.91 

-6.98 

-2.67 

Be 

-9.32 

-5.41 

-8.41 

-5.79 

Mg 

-7.62 

-2.97 

-6.88 

-3.84 

Zn 

-9.39 

-4.09 

-7.96 

-4.02 

Cd 

-8.99 

-4.17 

-7.21 

-3.99 

Hg 

- 10.43 

-4.35 

-7.10 

-3.95 

B 

-14.00 

-8.30 

-13.46 

-8.43 

Al 

-11.78 

-5.98 

- 10.70 

-5.71 

Ga 

-13.23 

-5.90 

-11.55 

-5.67 

In 

- 12.03 

-5.56 

-10.14 

-5.37 

C 

-19.81 

-11.26 

-19.37 

-11.07 

Si 

- 15.03 

-8.15 

- 14.79 

-7.58 

Ge 

-16.40 

-7.75 

-15.15 

-7.33 

Sn 

- 14.53 

-7.03 

-13.04 

-6.76 

Pb 

-15.25 

-6.45 

- 12.48 

-6.53 

N 

-26.08 

-14.54 

-26.22 

-13.84 

P 

-19.62 

- 10.57 

-19.22 

-9.54 

As 

-20.02 

-9.93 

-18.91 

-8.98 

Sb 

-17.56 

-8.77 

-16.02 

-8.14 

O 

-28.55 

-13.561 

-34.02 

-16.72 

S 

-21.16 

- 10.39 

-24.01 

-11.60 

Se 

-2141 

-9.90 

-22.86 

-10.68 

Te 

-19.12 

-9.32 

-19.12 

-9.54 

F 

-36.23 

-17.44 

-42.78 

- 19.86 

Cl 

-25.81 

-13.05 

-29.19 

-13.78 

Br 

-24.95 

-12.01 

-27.00 

-12.43 

I 

-21.95 

-10.79 

-22.34 

-10.97 

tions  is  remarkable.  However,  the  overall  calculated  values  for  these  two 
shear  coefficients  are  also  consistently  smaller  than  the  experimental  values. 

The  above  comparisons  show  that  the  BOM  and  BS  calculations  predict 
similar  qualitative  trends  for  the  binding  energies  and  the  elastic  constants.  In 
this  regard,  the  BOM  has  the  advantage  of  providing  explicit  forms  to  show 
the  dependences  on  bond  lengths  and  polarities.  However,  the  merit  of  the 
TB  theory  over  the  valence-force  model  is  in  its  ability  to  incorporate  atomic 
quantities  to  mimic  quantum  mechanical  effects.  To  be  useful  for  specific 
material  science  applications  (for  example  in  alloy  surface  segregation,  see 
Patrick  et  ai,  1987  and  1988;  Sher  et  aU  1988),  the  theory  has  to  be  more 
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TABLE  V 


Experime?^al  properties  of  semiconductors* 


d 

■^bond 

B 

C 

0) 

c 

1.540 

-3.68 

44.227 

95.120 

57.740 

1332 

Si 

2.352 

-2.32 

9.923 

10.274 

8.036 

520 

Ge 

2.450 

-1.94 

7.653 

8.189 

6.816 

301 

AIP 

2.367 

-2.13 

8.600 

6.900 

6.150 

440 

AlAs 

2.451 

-1.89 

7.727 

7.160 

5.420 

361 

AlSb 

2.656 

-1.76 

5.817 

4.428 

4.076 

366 

GaP 

2.360 

-1.78 

9.143 

7.780 

7.143 

367 

GaAs 

2.448 

-1.63 

7.690 

6.630 

6.040 

269 

GaSb 

2.640 

-1.48 

5.792 

4.946 

4.440 

231 

InP 

2.541 

-1.74 

1.241 

4.460 

4.600 

304 

InAs 

2.622 

-1.55 

5.794 

3.803 

3.959 

219 

InSb 

2.805 

-1.40 

4.831 

3.130 

3.132 

185 

ZnS 

2.342 

-1.59 

7.637 

3.990 

4.558 

279 

ZnSe 

2.454 

-1.29 

6.457 

3.560 

3.984 

213 

ZnTe 

2.637 

-1.20 

5.090 

3.060 

3.120 

177 

CdTe 

2.806 

-1.10 

4.210 

1.680 

2.040 

141 

HgTe 

2.798 

-0.81 

4.759 

1.817 

2.259 

116 

“Values  of  bond  length  d,  bond  energy  bulk  modulus  B,  and  shear  coeflSacnt 

C  =  Cn  —  C,2  used  to  determine  the  parameters  in  Tables  VII  through  XI.  Also  listed  are 
the  experimental  values  of  and  the  TO  optical  phonon  mode  to  at  f  to  be  compared  with 
the  calculations.  All  the  clastic  constants  arc  in  units  of  10"  dynes/cm*,  d  in  A,  in  cV, 
and  (jj  in  terms  of  wave  numbers  in  1/cm.  All  bond  lengths  are  deduced  from  the  lattice 
constants  quoted  by  Zallen  (1982).  The  values  of  Ebond  ^^c  taken  from  Harrison  (1980),  Table 
7-3,  except  for  AlSb,  ZnTc,  CdTe,  and  HgTe,  which  arc  deduced  from  the  Phillips  (1973) 
Table  8.2.  The  elastic  constants  are  taken  from  Table  III,  and  the  phonon  frequencies  arc 
taken  from  values  cor'piled  in  “Landolt-Borstein  Numcnca!  Data  and  Functional  Relation¬ 
ships  in  Science  and  Technology,"  New  Series,  edited  by  K.-H.  Helllwidgc,  Vols.  17  and  22. 


quantitative.  Successful  quantitative  application  of  the  TB  theory  has  been 
made  by  Chadi  (1978, 1979,  and  1984)  in  his  study  of  semiconductor  surfaces. 
The  comparisons  in  Table  VI  indicate  that  the  BOM  should  be  treated 
differently  from  the  BS  calculation  when  considered  for  quantitative  applica¬ 
tions.  If  one  wishes  to  calculate  the  properties  using  the  local  picture,  one 
should  use  the  BOM.  If  one  wants  to  carry  out  the  TB  Hamiltonian  precisely, 
one  needs  to  adopt  a  different  set  of  parameters  based  on  the  BS  calculation. 
For  this  reason,  we  shall  next  consider  quantitative  applications  of  BOM  and 
BS  calculations  separately. 

There  are  many  ways  to  parametrize  the  TB  theory.  Chadi  (1979,  1984) 
used  a  simple  form  for  the  repulsive  energy  u  =  a  +  b{d  —  do)  +  c(<^  ~ 
and  the  same  l/d^  scaling  for  the  TB  parameters  To  keep  the  theory  as 
close  to  Harrison’s  (1983a  and  1983b)  form  as  possible  but  free  it  from  the 
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TABLE  VI 


Comparison  of  BOM  and  BS  models-  predictions' 


J 

r 

bonds 

B 

C 

BS 

BOM 

BS 

BOM 

BS 

BOM 

BS 

BOM 

c 

-7.17 

-7.08 

49.31 

47.39 

67.15 

67.10 

44.45 

51.78 

Si 

-2.39 

-2.50 

4.65 

4.18 

7.19 

8.07 

5.10 

5.29 

Ge 

-  2.03 

-2.34 

2.89 

1.91 

4.72 

6.58 

3.69 

3.00 

AIP 

-2.59 

-2.54 

3.81 

4.21 

4.45 

4.83 

3.45 

4.10 

AlAs 

-2.20 

3.12 

3.66 

3.85 

4.07 

2.91 

3.50 

AlSb 

-1.64 

-1.62 

1.99 

lAl 

2.75 

3.12 

2.11 

2.54 

GaP 

-2.32 

-2.16 

3.63 

4.40 

4.48 

5.15 

3.69 

4.34 

GaAs 

-  1.99 

-1.85 

2.83 

3.76 

3.75 

4.30 

3.02 

3.66 

GaSb 

-1.53 

-1.43 

1.76 

2.45 

2.63 

3.40 

2.17 

2.65 

InP 

-2.21 

-2.06 

2.35 

2.91 

2.69 

2.93 

2.21 

2.63 

InAs 

-1.89 

-1.74 

1.93 

2.67 

2.38 

2.52 

1.90 

2.32 

InSb 

-1.43 

-1.28 

1.28 

1.97 

1.81 

2.17 

1.48 

1.8" 

ZnS 

-1,83 

-1.79 

346 

3.74 

3.53 

3.62 

2.71 

3.30 

ZnSe 

-  1.52 

-1.48 

2.67 

3.06 

2.75 

2.67 

2.08 

2.53 

ZnTe 

-1.15 

-1.05 

1.82 

2.16 

1.88 

1.87 

1.50 

1.78 

CdTe 

-1.06 

-0.97 

1.24 

1.47 

1.22 

1.14 

0.96 

1.13 

HgTe 

-0.72 

-0.49 

1.23 

1.72 

1.33 

1.31 

1.11 

1.30 

“Comparison  of  the  tight-binding  theory  using  the  full  band  structures  (BS)  and  the  bond 
orbital  model  (BOM  for  bond  energyes  bulk  moduli  B,  and  shear  coefficients  C  = 

Cj ,  —  Cl  -  and  C44.  All  energies  are  in  eV  and  elastic  constants  in  10"  dynbes/cm^. 


\/d‘  and  l/d‘^  scaling  rules  for  and  u,  respectively,  we  assume  the 
following  forms: 


=  yUdo/d)"  (120) 

and 

u^uoidjdr,  (121) 

where  the  superscript  and  subscript  0  indicate  the  values  evaluated  at  the 
equilibrium  bond  length  dQ.  For  simplicity,  the  values  of  are  taken  to  be 
Harrison's  values  given  in  Eq.  (106)  scaled  by  a  factor /: 

(122) 

Thus,  there  are  four  parameters  for  each  system:  the  scaling  parameter  /, 
the  powers  n  and  m,  and  the  value  Uq.  These  parameters  can  be  determined  by 
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requiring  that  the  model  produce  the  correct  experimental  values  for  £bond’ 
do,  Cji  -  Cl-  and  B.  Since  —  Cjj  is  only  governed  by  in  both 
BOM  and  Land  calculation,  it  alone  determines  the  scaling  factor/  Then  the 
bond  energy  can  be  used  to  determine  u^.  The  requirement  that  the  first 
derivative  of  E-^  be  zero  at  do  then  determines  the  ratio  of  the  powers  n/m, 
w  hich  couples  with  the  equation  for  the  bulk  modulus  to  yield  the  values  for  n 
and  m.  One  can  then  use  these  sets  of  parameters  to  check  the  validity  of  the 
model  by  calculating  other  quantities  not  employed  in  the  fitting,  e.g.,  C44, 
the  internal  displacement  parameter  C,  and  the  optical  phonon  frequencies  cu 
at  the  zone  center.  If  the  results  are  acceptable,  the  model  can  be  extended  to 
more  complicated  systems  such  as  alloys  and  superlattices  with  local 
environments  similar  to  the  bulk  crystals. 

Table  VII  shows  the  results  for /,  n,  m,  and  Uq  obtained  from  the  preceding 
fitting  procedure  by  using  the  full  band-structure  calculations,  and  the 
corresponding  values  of  C44,  and  (o  calculated  for  consistency  checks.  The 
scaling  factor / ranges  from  1  to  1.4  and  tends  to  decrease  with  an  increase  in 


TABLE  Vn 


Full  band  structure  calculation* 


/ 

n 

m 

«o 

C 

OJ 

c 

1.390 

2.840 

3.767 

21.924 

48.393 

0.121 

1,459 

Si 

1.326 

3.040 

5.001 

6.938 

8.013 

0.511 

572 

Ge 

1.388 

3.204 

5.278 

6.415 

6.841 

0.487 

342 

AlP 

1.294 

3.530 

5.598 

6.435 

5.827 

0.516 

447 

AIAs 

1.464 

3.524 

5.430 

7.089 

5.598 

0.459 

384 

AiSb 

1.337 

3.268 

5.668 

4.838 

3.944 

0.564 

354 

GaP 

1.395 

3.705 

5.683 

7.285 

6.857 

0.501 

382 

GaAs 

1.397 

3.633 

5.716 

6.530 

5.791 

0.500 

292 

GaSb 

1.431 

3.471 

5.717 

5.519 

4.515 

0.536 

256 

InP 

1.323 

4.240 

6.633 

5.603 

4.260 

0.584 

304 

InAs 

1.300 

3.997 

6A21 

4.962 

3.564 

0.552 

220 

InSb 

1.353 

3.773 

6.399 

4.350 

3.092 

0.602 

200 

ZnS 

1.062 

3.308 

5.996 

4.225 

3.727 

0.632 

325 

ZnSe 

1.134 

3.420 

5.994 

4.260 

3.164 

0.576 

233 

ZnTe 

1.284 

3.306 

5.828 

4.285 

2.813 

0.590 

205 

CdTe 

1.171 

3.656 

6.761 

3.092 

1.701 

0.694 

156 

HgTe 

1.173 

3.V  » 

7.074 

3.080 

2.040 

0.716 

152 

The  results  for  the  parameters  /,  n,  m,  and  obtained  from  the  fitting  of  the 
bond  energy,  bond  length,  bulk  modulus,  and  shear  coefficient  C,,  —  C,2  of  Table 
V  using  the  full  band  structure  calculation.  Also  listed  are  the  calculated  C^, 
internal  displacement  parameter  C.  tmd  the  TO  optical  phonon  mode  w  at  T.  All  the 
elastic  constants  are  in  units  of  10"  dynes/cm^  Uq  is  in  cV,  and  w  are  wave 
numbers  in  1/cm. 
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polarity.  In  the  power  dependence  of  oc  {do/d)”,  n  ranges  from  2.8  to  4.3, 
which  is  larger  than  the  n  =  2  used  in  Harrison’s  universal  TB  parameters. 
For  the  repulsive  pair  energy  u  =  Uoido/d)'”,  the  power  m  ranges  from  3.8  to 
6.8.  The  ratio  m/n  falls  in  the  range  from  1.3  to  1.9,  which  is  smaller  than  the 
m/n  =  2  used  by  Harrison.  The  calculated  values  of  C4.4  for  most  systems 
agree  with  the  experimental  data  to  10%  or  better,  except  for  diamond  and 
ZnS.  Note  that  the  experimental  data  for  ZnS  are  rather  dispersed.  The 
calculated  TO  optical  phonon  modes  at  T  in  1/cm  for  most  group  IV  and  III- 
V  systems  also  agree  with  experiments  to  10%  or  better.  The  discrepancies  for 
the  1 1- VI  systems  are  larger,  about  15%.  Reliable  results  for  C  from  both 
experiments  and  first-principles  calculations  are  available  only  for  a  limited 
number  of  systems.  The  calculated  C  in  the  TB  model  agrees  very  well  with 
those  results,  as  shown  in  Table  I.  The  overall  results  for  C44,  C,  and  co  in  the 
TB  calculations  are  equivalent  to  those  based  on  the  valence  force  model, 
including  Martin’s  Coulomb  force  corrections.  By  construction,  the  TB 
model  also  produces  the  correct  cohesive  energies,  bond  lengths,  bulk  moduli 
and  shear  coefficient  Cu  —  Cjj,  because  these  quantities  are  used  to  fit  the 
parameters. 

The  results  in  Table  VII  are  based  on  the  term  values  given  in  Table  IV  and 
the  TB  parameters  scaled  from  Harrison’s  universal  parameters.  It  is  useful  to 
know  how  the  predictions  are  influenced  by  these  parameters  and  the  fitting 
procedure.  Table  VIII  shows  the  results  based  on  Chadi  procedure  in  which 
the  TB  matrix  elements  !%•  are  scaled  as  l/d^,  and  the  repulsive  pair  energy  is 
taken  to  be  u  =  Uq  +  u^id  —  do)  +  Ujfd  —  do)^-  The  parameter  Uq  is  set  to 
produce  the  correct  bond  energy,  u,  is  determined  by  requiring  the  correct 
equilibrium  bond  length,  and  Uj  is  fixed  by  the  bulk  modulus.  Two  sets  of  TB 
parameters  are  tabulated  for  each  system:  One  is  the  set  used  by  Chadi  (1978, 
1979,  and  1984),  and  the  other  is  the  set  obtained  by  multiplying  Harrison’s 
by  scaling  factor  /listed  in  Table  VIII.  For  convenient  comparison,  the 
zero  of  the  term  values  is  set  equal  to  the  anion  s  energy.  Despite  considerable 
differences  in  these  two  sets  of  TB  parameters,  the  results  of  the  predictions 
from  both  sets  are  very  similar  and  also  very  similar  to  those  predicted  from 
the  other  procedure  used  for  the  results  in  Table  VII.  The  only  noticeable 
difference  between  the  predictions  in  Table  VIII  and  Table  VII  is  that  the 
present  procedure  produces  larger  phonon  frequencies  and  slightly  smaller 
C44  values.  We  also  note  that  the  fitted  parameters  Uq,  Uj,  and  Uj  ^^d  the 
predicted  values  for  Chadi ’s  elastic  constant  set  in  Table  VIII  are  not  the 
same  as  Chadi ’s  published  (1979)  values;  these  give  bulk  moduli  about  20% 
smaller  than  the  experimental  values,  but  give  considerably  better  phonon 
frequencies. 

To  parametrize  the  BOM,  several  different  stages  of  approximations  can 
be  made.  However,  for  a  general  application,  the  full  BOM  steps  presented  in 
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TABLE  VIII 

Two  SETS  OF  TB  PARAMETERS  * 


SI 


4 

y.^ 

y,p, 

wP** 

Chadi 

0.0 

7.20 

0.0 

-2.03 

2.55 

4.55 

-1.09 

Present 

0.0 

6.88 

0.0 

6.88 

-2.41 

2.59 

-1.15 

“o 

C 

c 

w 

Chadi 

7.29 

-9.98 

23.90 

BE! 

0.49 

620 

Present 

6.93 

-9.70 

23.42 

D 

li 

0.51 

592 

GaAs 

f/AC 

y^.i. 

y.^ 

Chadi 

0.0 

9.64 

5.12 

11.56 

-1.70 

2.40 

1.90 

3.44 

-0.89 

Present 

0.0 

10.09 

6.79 

14.12 

-2.34 

2.52 

2.52 

3.94 

-1.12 

“o 

“2 

C 

c 

<jj 

RHH 

6.36 

5.60 

8.77 

0.54 

339 

W^m 

6.63 

5.70 

8.53 

0.54 

322 

‘Comparison  between  the  two  different  sets  of  TB  parameters  described  in  the  text,  the  resultant 
expression  coefficients  u^,,  u , ,  U2  of  the  repulsive  pair  energy  u,  and  the  predicted  elastic  constants, 
Kleinmann  internal  displacement  parameters  Q,  and  phonon  frequency  cu  from  Chadi  fitting 
scheme. 


Eqs.  (100)  to  (105)  for  calculating  should  be  followed,  regardless  of 
approximations.  The  simplest  model,  referred  to  as  BOM(l),  is  to  include 
only  and  Fjc  in  the  matrix  element  for  the  calculation  of  the 

metallization  energy  in  Eq.  (104).  The  next  approximation,  BOM(2),  is  to 
include  V]  as  well.  Finally,  one  can  include  all  the  first-neighbor  interatomic 
TB  parameters  in  <blHla'>;  this  will  extend  the  |a'>  to  those  belonging  to  the 
second-neighbor  bonds.  This  last  approximation  will  be  referred  to  as 
BOM(3). 

Table  IX  shows  the  results  for  BOM(l)  following  the  parameterization 
procedure  described  in  Eqs.  (120)  to  (122).  The  httted  parameters  f,  n,  m,  and 
Uq  are  substantially  different  from  those  based  on  the  BS  calculations.  The 
predicted  for  the  group  IV  and  III-V  systems  are  slightly  larger  than  the 
experimental  values,  but  good  for  the  II- VI  systems.  The  calculated  C  values, 
although  not  all  smaller  than  those  in  Table  VII,  are  smaller  on  the  average. 
The  predicted  phonon  frequencies  are  too  high. 

The  parameters  and  the  predicted  results  from  BOM(2)  are  listed  in 
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TABLE  IX 


BOM(l)  Calculations* 


/ 

n 

m 

“0 

c** 

(Jj 

c 

1.440 

2.896 

22.345 

61.098 

2,103 

Si 

1,356 

3.208 

5.166 

7.023 

9.729 

Ge 

1.395 

3.666 

5.854 

6.383 

8.240 

0.506 

416 

AlP 

1.047 

3.334 

5.842 

4.418 

6.627 

0.460 

571 

AlAs 

1.264 

3.530 

5.685 

5.538 

6.668 

0.431 

491 

AlSb 

1.179 

3.170 

5.831 

3.818 

4.482 

0.511 

411 

GaP 

1.154 

3.439 

5.744 

5.239 

7.500 

0.453 

496 

GaAs 

1.179 

3.337 

5.647 

4.863 

6.340 

0.456 

358 

GaSb 

1.274 

3.354 

5.797 

4.456 

4.950 

0.500 

302 

InP 

0.999 

3.579 

6.490 

3.413 

4.558 

0.522 

388 

InAs 

0.995 

3.167 

5.960 

3.081 

3.787 

0.496 

260 

InSb 

1.126 

3.228 

6.105 

3.091 

3.267 

0.543 

224 

ZnS 

0.692 

2.750 

7.262 

1.642 

4.312 

0.580 

401 

ZnSe 

0.750 

2.823 

6.839 

1.821 

3.685 

0.544 

287 

ZnTe 

0.888 

2.715 

6.484 

1.987 

3.156 

0.531 

248 

CdTe 

0.734 

2.720 

8.394 

1.018 

1.949 

0.660 

189 

HgTe 

0.732 

2.202 

8.810 

0.842 

2.140 

0.677 

177 

The  results  for  the  parameters  /,  n,  m,  and  Uq  obtained  from  the  fitting  of  the  bond  energy, 
bond  length,  bulk  modulus,  and  shear  coefficient  C,,  —  C12  of  Table  V  using  the  BOM(l) 
described  in  the  text.  Also  listed  are  the  calculated  C44,  internal  displacement  parameter  C> 
and  the  TO  optical  phonon  mode  w  at  T.  All  the  elastic  constants  are  in  units  of  10" 
dynes/cm^,  Uq  is  in  eV,  and  to  is  given  in  terms  of  wave  number  in  1/cm. 


Table  X.  These  parameters  more  closely  resemble  those  in  Table  VIII  than  do 
the  values  from  BOM(l).  However,  the  predicted  C44  values  are  still  too 
small,  and  the  C  values  are  too  large,  but  the  co  values  are  better  than  those 
from  BOM(l). 

Table  XI  contains  the  results  from  BOM(3);  these  approach  those  of  the 
BS  calculations.  In  comparison  with  Table  VII  and  the  experimental  values 
in  Table  V,  the  BOM(3)  does  well  for  co,  produces  slightly  smaller  C44,  and 
probably  slightly  larger  C  values. 

In  conclusion,  the  TB  method  is  a  reasonable  approach  to  the  static  elastic 
properties  of  semiconductors.  If  carried  out  rigorously,  the  TB  parameters  in 
Table  VII  will  provide  quantitative  results  for  superlattices,  alloys,  and 
possibly  surfaces  in  which  the  local  environments  are  similar  to  those  in  the 
bulk.  The  quantitative  predications  of  BOM  are  not  as  good  as  the  BS 
calculations,  but  are  still  reasonable.  The  fitted  parameters  given  in  Tables  IX 
to  XI  allow  different  stages  of  approximations  to  be  made  using  the  BOM. 
This  is  especially  useful  for  more  complicated  systems,  because  computation¬ 
ally  the  BOM  is  about  two  orders  faster  than  the  band  structure  calculations. 
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TABLE  X 


BOM(2)  Calculations* 


/ 

n 

m 

“o 

Cm 

c 

c 

1.440 

2.896 

3.809 

21345 

55.161 

■!Bf 

1672 

Si 

1.356 

3.208 

5.166 

7.023 

7.520 

597 

Ge 

1.395 

3.666 

5.854 

6.383 

6.106 

0.720 

358 

AlP 

1.283 

3.440 

5.493 

6.171 

5.094 

0.677 

511 

AlAs 

1.472 

3.543 

5.419 

6.979 

5.103 

0.640 

440 

AlSb 

1.342 

3.249 

5.580 

4.771 

3.256 

0.706 

373 

GaP 

1.371 

3.540 

5.490 

6.901 

5.916 

0.688 

428 

GaAs 

1.387 

3.455 

5.452 

6.306 

4.864 

0.681 

317 

GaSb 

1.414 

3.398 

5.593 

5.305 

3.681 

0.717 

264 

InP 

1.307 

3.846 

6.115 

5.339 

3.446 

0.744 

354 

InAs 

1.298 

•3.528 

5.731 

4.834 

1806 

0.706 

243 

InSb 

1.342 

3.464 

5.910 

4204 

2.351 

0.747 

207 

ZnS 

1.062 

3.102 

5.728 

4.075 

3.016 

0.726 

373 

ZnSe 

1.141 

3.150 

5.587 

4.181 

1577 

0.688 

271 

ZnTe 

1.283 

3.067 

5.498 

4.140 

1270 

0.704 

231 

CdTe 

1.178 

3.271 

6.216 

3.011 

1.319 

0.764 

177 

HgTc 

1.169 

3.046 

6.116 

1916 

1.457 

0.793 

162 

'The  results  for  the  parameters  /,  n,  m,  and  «o  obtained  from  the  fitting  of  the  bond  energy, 
bond  length,  bulk  modulus,  and  shear  coefficient  C,,  —  of  Table  V  using  the  BOM(2) 
described  in  the  text.  Also  listed  are  the  calculated  Cm,  internal  displacement  parameter  (, 
and  the  TO  optical  phonon  model  o)  at  F.  All  the  elastic  constants  are  in  units  of  10“ 
dynes/cm^,  Uq  is  in  eV,  and  <o  is  wave  number  in  1/cm. 


11.  Semiconductor  Alloys 

The  systems  to  be  considered  in  this  section  are  alloys  of  the  diamond  and 
zincblende  semiconductors,  both  the  ordered  and  disordered  alloys.  The 
ordered  alloys  include  binary  compounds  such  as  SiC,  and  ternary  com¬ 
pounds,  such  as  GalnAsj,  in  three  crystal  structures  of  the  types  CuAuI, 
chalcopyrite,  and  CuPt  as  shown  in  Fig. 4  (Bernard  eta/.,  1988).  The 
disordered  alloys  include  binary  solutions  such  as  Sij-.Ge,  and  pseudobin¬ 
aries  such  as  Hgi_,Cd,Te  and  GaASj.^^bjt,  where  x  is  the  fractional 
concentration.  These  alloys  have  been  widely  used  and  studied;  however, 
detailed  information  about  their  elastic  constants  is  scarce  both  experimen¬ 
tally  and  theoretically.  One  reason  for  the  lack  of  rigorous  calculation  is  that 
the  elastic  constants  of  these  systems  are  more  complex;  existing  theories  are 
not  as  accurate,  particularly  for  disordered  alloys.  Another  reason  may  be 
attributed  to  the  fact  that  most  properties  of  these  alloys,  including  their 
elasticity,  were  thought  to  be  reasonably  well  approximated  by  the  concen- 
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TABLE  XI 


BOM(3)  Calculations* 


/ 

n 

m 

“0 

c 

O) 

c 

1.336 

2.901 

3.872 

20.814 

47.691 

0.135 

1,531 

Si 

1.262 

3.251 

5.346 

6.484 

7.472 

0.580 

562 

3e 

1.302 

3.886 

6.264 

5.942 

6.171 

0.658 

333 

AlP 

1.193 

3.43 

5.77 

5.722 

4.936 

0.541 

452 

AlAs 

1.369 

3.559 

5.507 

6.498 

4.812 

0.490 

387 

AlSb 

1.249 

3.253 

5.669 

4.402 

3.248 

0.619 

343 

GaP 

1.276 

3.550 

5.575 

6.422 

5.738 

0.542 

373 

GaAs 

1.290 

3.445 

5.511 

5.855 

4.724 

0.549 

281 

GaSb 

1.318 

3.415 

5.704 

4.923 

3.670 

0.621 

239 

InP 

1.210 

3.812 

6.153 

4.929 

3.398 

0.604 

304 

InAs 

1.199 

3.456 

5.724 

4.442 

2.755 

0.579 

216 

InSb 

1.245 

3.411 

5.943 

3.871 

2.367 

0.657 

188 

ZnS 

0.968 

3.062 

5.848 

3.618 

3.120 

0.630 

343 

ZnSe 

1.040 

3.097 

5.671 

3.727 

2.611 

0.580 

247 

ZnTe 

1.178 

3.019 

5.573 

3.371 

2.301 

0.601 

212 

CdTe 

1.065 

3.198 

6.368 

2.621 

1.403 

0.690 

165 

HgTe 

1.059 

2.937 

6.262 

2.523 

1.563 

0.730 

154 

The  results  for  the  parameters/  n,  m,  and  u^,  obtained  from  the  fitting  of  the  bond  energy, 
bond  length,  bulk  modulus,  and  shear  cocfBcient  C,,  —  C,2  of  Table  V  using  the  BOM(3) 
described  in  the  text.  Also  listed  are  the  calculated  C44,  internal  displacement  parameter  C. 
and  the  TO  optical  phonon  mode  w  at  F.  All  the  elastic  constants  are  in  units  of  10" 
dynes/cm^,  Uq  is  in  eV,  and  oj  is  wave  number  in  1/cm. 


tration-weighted  averages  of  their  constituents.  Because  of  the  rudimentary 
state  of  the  theory,  we  shall  deal  only  with  the  simplest  elastic  constant,  the 
bulk  modulus.  Our  focus  is  on  the  diflference  between  the  alloy  bulk  modulus 
B  and  the  concentration-weighted  averaged  value  E,  i.e.,  AB  —  B  —  B.  There 
are  several  fundamental  questions  that  can  be  addressed.  Is  AB  positive  or 
negative?  Do  the  sign  and  the  magnitude  of  AB  depend  on  the  state  of  order? 
How  can  we  calculate  the  bulk  modulus  in  a  disordered  system?  Analysis  of 
these  questions  constitutes  the  content  of  this  section. 


a.  Ordered  Alloys 

Long-range  ordering  has  been  found  to  exist  in  many  epitaxially  grown 
III-V  semiconductor  alloys  (Kuan  et  a/.,  1985;  Jen  et  al,  1986;  Kuan  et  al, 
1987;  Ihm  etal,  1987;  Klem  etal,  1987;  Huang  etal,  1988;  Gomyo  etal, 
1987  and  1988;  Norman  etal,  1987;  Shahid  etal,  1987).  Wei  and  Zunger 
(1989)  have  given  a  rather  complete  list  of  these  ordered  alloys,  most  of  which 
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are  of  the  form  ABC,  existing  in  three  different  types  of  structures,  CuAuI, 
chalcopyrite,  and  CuPt;  all  have  their  atomic  planec  stacked  as  ACBCACBC, 
but  along  three  different  directions,  (100),  (201),  and  (111),  resp)ectively,  as 
shown  in  Fig.  4.  There  have  been  several  first-principles  calculations  made  to 
study  the  structural  properties  of  these  compounds;  these  focused  mainly  on 
the  cohesive  energies  and  bond  lengths.  These  results  have  been  compared 
with  TB  calculations  by  Yeh  et  al.  (1990).  As  indicated  in  the  preceding 
section,  one  virtue  to  fitting  the  TB  model  for  the  bulk  semiconductors  is  to 
use  it  for  interpolating  alloy  properties.  Table  XII  lists  the  results  for  the  bulk 
moduli  of  a  number  of  III-V  and  II-VI  alloys  derived  from  full  TB  band 
structure  calculations  using  the  parameters  given  in  Table  VII.  Also  listed  are 
the  average  values  and  percentage  deviations  from  the  mean  AB/E.  Note  that 
all  AB  values  are  negative  and  that  most  of  the  magnitudes  are  small,  except 
for  GajAsSb  and  Ga^PSb;  the  latter  has  the  largest  difference  in  the 
constituent  compounds  among  the  alloys  listed.  Although  the  magnitudes  of 
AB  get  larger  for  systems  with  larger  differences  in  the  bond  lengths,  the 
dependence  does  not  seem  to  be  a  simple  function  of  the  bond  length 
difference.  The  uniformly  negative  AB  values  also  appeared  in  the  first- 
principles  local-density  functional  calculations  for  ordered  GaAsSb  alloys  by 
Ferreira  et  al.  (1989),  as  shown  in  Table  XIII. 

TABLE  XII 

Ordered  alloys;  TB  elastic  constant  calculations* 


^B/B  X  100 


Alloy 

^Ch 

Be. 

ZBcp 

E 

Ch 

Ca 

Cp 

AlGaAs 

7.695 

7.693 

7.689 

7.009 

-0.18 

-0.20 

-0.25 

AlGaP 

8.858 

8.858 

8.854 

8.872 

-0.15 

-0.15 

-0.20 

GalnSb 

5.226 

5.202 

J.156 

5.312 

-1.61 

-2.07 

-2.92 

AlInAs 

6.705 

6.691 

6.661 

6  761 

-0.83 

-1.03 

-1.48 

InGaAS 

6.610 

6.579 

6.508 

6.742 

-1.96 

-2.42 

-3.47 

InAlP 

7.876 

7.860 

7.774 

7.924 

-0.61 

-0.08 

-1.88 

GalnP 

8.007 

8.035 

8.878 

8.195 

-1.44 

-1.95 

-3.87 

GaAsP 

8.328 

8.291 

8.294 

8.417 

-1.05 

-1.50 

-1.46 

GaAsSb 

6.314 

6.198 

6.157 

6.741 

-6.34 

-8.05 

-8.66 

GaPSb 

6.584 

6.297 

6.188 

7.468 

-11.84 

-15.68 

-17.14 

HgCdTe 

4.470 

4.472 

4.471 

4.485 

-0.33 

-0.28 

-0.31 

HgZnTe 

4.890 

4.887 

^.632 

4.925 

-0.71 

-  0.76 

-5.93 

CdZnTe 

4.611 

4.604 

4.338 

4.650 

-0.85 

-1.00 

-6.72 

'Bulk 

moduli  (in  10" 

dynes/cm^)  of  order. 

;j  ^!:.ys 

calculated 

using  the  full  TB 

band- 

structure  method  described  in  Section  10.  The  three  structures  are  chachopyrite  (Ch),  CuAuI 
(Ca).  and  CuPt(Cp)  types.  E  is  the  average  value  of  the  constituent  compounds,  and 
AB  =  B-  B. 
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The  reason  for  the  negative  values  of  AB,  in  a  very  qualitative  argument,  is 
that  the  bulk  moduli  of  semiconductors  scale  inversely  as  high  powers  of  the 
lattice  constant,  and,  at  the  same  time,  the  alloy  lattice  constant  is  approxi¬ 
mated  well  by  the  mean  value,  by  Vegard’s  law  (1921).  This  implies  that  the 
value  of  B  at  the  mean  lattice  constant  should  lie  below  the  straight-line 
average.  Since  the  TB  results  for  the  bulk  moduli  constants  should  not  be 
qualitatively  different  from  the  valence  force-field  model  (VFF)  predictions, 
most  of  the  key  physics  for  the  bowing  of  B  should  be  contained  in  a  VFF 
analysis.  The  major  effects  in  the  VFF  can  in  turn  be  realized  from  the 
following  simple  analysis. 

Consider  the  local  structure  of  a  CuAuI  or  chalcopyrite  crystal  ABC2. 
Focus  on  a  local  tetrahedral  cluster  AjBjC  with  the  two  A  atoms  and  two  B 
atoms  on  the  vertices  of  the  tetrahedron  and  the  C  atom  near  the  center.  Let 
the  coordinates  of  the  two  A  atoms  be  (  —  1,  1,  —  \)d/y/3  and  (  —  1, 1,  —  \)d/ 
y/3  and  the  two  B  atoms  be  at  (1, 1,  \)d/y/3  and  (1,  —  1,  —  \)d/^.  Let  the 
force  constants  be  and  k^,  and  equilibrium  bond  lengths  be  for 

the  AC  and  BC  bonds,  respectively.  To  attain  equilibrium,  the  central  C  atom 

is  displaced  by  (e,  0, 0)d/y^.  We  further  define  mean  values  d-{d^  +  d^)l2 
and  k  =  {k^  +  k^)/!,  relative  differences  Sq  =  (dj^  -  d^)/d  and  Aq  =  (^a  ~ 
k^yk,  and  d  =  d{l  ■+ <5).  Then  the  AC  bond  is  stretched  by  an  amount 
d(S  +  e/3  -  <5o/2)  from  its  equilibrium  value,  and  similarly,  the  BC  bond  is 
compressed  by  did  —  e/3  +  Sq/I).  The  strain  energy  for  any  arbitrary  6  and  e 
is  given  by  A£  =  d^^k^id  +  e/3  —  do/2f  +  kf^id  —  e/3  -I-  5o/2)^].  When  A£  is 
minimized  with  respect  to  d  and  e,  one  finds  5  =  0  and  e  =  35o/2,  and  the 
minimum  A£  is  zero.  If  the  crystal  expands  uniformly  with  d  having  a  fixed 
small  value,  then  e  becomes  e  =  35o/2  —  3Ao5  and  A£  =  2k(l  —  Ao/4)d^5^. 
Thus,  the  effective  spring  constant  is 

k,„  =  k(l  -A2/4),  (123) 

which  is  smaller  than  the  average  value  k.  This  weakening  of  the  restoring 
force  constant  in  the  alloy  is  due  to  the  internal  displacement,  represented  by 
£  in  the  preceding  model,  which  provides  an  extra  degree  of  freedom  for 
relaxation  in  response  to  the  external  stress.  The  bulk  modulus  B  is 
proportional  to  k^ff/d,  so  the  alloy  bulk  modulus  minus  the  mean  B  is  then 
given  by 


AB  =  B(5oAo  -dl-  Ag/4) ,  (124) 


where  we  recall  the  definitions  do  =  (di  —  djVd  and  Aq  =  (kj  —  ^2)/^-  Since 
and  do  and  Aq  tend  to  have  different  signs,  the  bond  length  difference  gives  an 


209 


1 .  ELASTIC  CONSTANTS  AND  THEIR  ALLOYS 


55 


extra  negative  contribution  to  AB  (the  first  two  terms).  If  both  the  bond¬ 
stretching  force  constant  a  and  the  bond-angle  restoring  force  ^  in  the  VFF 
are  included,  the  equilibrium  value  of  AE  is  no  longer  zero,  but  the  deviation 
AB  can  also  be  shown  to  be  similar  to  Eq.  (123)  and  is  given  by 


AB  = 


(Ag)^  1 1 

(g  +  2/5)(3g  -h  P)]l 


4, 


(125) 


where  Ag  =  (gj  —  gj)  and  g  =  (gj  4-  a2)/2  and  similarly  for  Ap  and  p. 
Equation  (125)  reduces  to  (124),  if  P  is  set  equal  to  zero. 

The  above  descriptions  illustrate  two  mechanisms  for  the  negative  AB 
values,  the  l/o'*  scaling  of  B,  with  q  ranging  from  3.5  to  9,  and  more  degrees  of 
freedom  for  internal  relaxation.  Quantitative  results  should  be  described  by 
TB,  because  in  addition  to  the  strain  energy,  there  is  also  some  chemical  effect 
built  into  the  TB  theory.  Although  these  ordered  compounds  have  been 
found  from  epitaxial  growth,  the  bulk  moduli  are  probably  difficult  to 
measure,  because  these  alloys  are  not  single  bulk  crystals  and  because  the 
ordering  is  only  partial.  It  is  interesting  to  note  that  the  B  values  for  a  SiC/ 
AIN  alternating  layer  superlattice  along  (100)  and  for  the  constituent 
compounds  have  been  calculated  by  Lambrecht  and  Segall  (1990)  using  the 
LMTO;  their  percentage  deviation  from  the  mean  AB/B  was  found  to  be 
about  —2%,  which  falls  in  the  range  of  the  ternary  alloys  in  Tables  XII  and 
XIII. 

Not  all  the  mechanisms  considered  above  apply  to  the  ordered  compounds 
of  the  elemental  semiconductors,  because  internal  relaxation  under  pressure 
may  not  be  allowed,  e.g.,  if  the  structure  is  assumed  to  be  zincblende. 
Unfortunately,  a  simple  analysis  of  the  elastic  constants  of  the  4-4  com¬ 
pounds  cannot  yet  be  made,  because  the  tight-binding  and  VFF  parameters 
have  not  yet  been  extended  to  deal  with  the  atomic  pairs  not  existing  in  the 


TABLE  XIII 

Ordered  alloys;  bulk  moduli* 


GaAs 

GaSb 

Ga^AsSb 

Ga4As3Sb 

Ga4AsSb3 

Structure 

zb 

zb 

CA 

CH 

CP 

LU 

FA 

LU 

FA 

B 

t^BjE  X  100 

lAt 

5.18 

6.10 

-3.5 

5.92 

-6.3 

5.96 

-5.7 

6.52 

-5.4 

0.58 

-4.5 

5.40 

-6.1 

5.31 

-7.7 

'Calculated  bulk  moduli  for  GaAs,  GaSb,  and  GaAsSb  ordered  alloys  by  Ferreira  et  al. 
(1989),  and  the  corresponding  percentage  deviation  from  the  concentration-weighted  average. 
The  structures  are  zincblende  (zb),  CuAl  (CA),  chachopyrite  (CH),  CuPt  (CP),  luzonite  (LU), 
and  famatinite  (FA). 
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constituent  crystals.  However,  several  first-principles  calculations  have  been 
made  on  the  ordered  SiC  and  SiGe  (Martins  and  Zunger,  1986;  Qteish  and 
Resta,  1988;  van  Schilfgaarde,  1990).  The  main  results  are  listed  in  Table 
XIII;  the  theoretical  results  were  calculated  for  the  zincblende  structure.  The 
plane-wave  pseudopotential  (PP-PW)  calculation  of  Martins  and  Zunger 
(1986)  for  SiC  gave  a  —  21%  value  for  AB/B,  which  is  in  reasonable  agreement 
with  the  experimental  value  of  — 17%.  This  is  consistent  with  the  qualitative 
argument  based  on  the  I/d'*  (q  >  3.5)  scaling  of  B.  The  theory  also  yielded  a 
negative  formation  energy,  which  also  agrees  with  experiment.  For  SiGe,  the 
theoretical  calculations  cited  in  Table  XIV  gave  positive  formation  energies, 
which  are  consistent  with  the  fact  that  no  ordered  bulk  compounds  of  SiGe 
have  been  grown.  However,  some  weak  ordering  has  been  found  in  the 
epitaxial  SiGe  films  (Ourmazd  and  Bean,  1985).  The  calculated  values  of  AB/ 
B  for  the  zincblende  SiGe  are  either  slightly  above  or  just  below  zero.  These 
differences,  however,  fall  within  the  uncertainties  of  the  present  first-princi¬ 
ples  theory.  The  best  conclusion  that  can  be  drawn  from  these  results  is  that 
the  B  value  for  SiGe  should  be  very  close  to  the  mean  value. 


b.  Disordered  Alloys 


Disordered  binary  alloys  Aj  .^^B,  of  diamond  semiconductors  and  pseudo¬ 
binary  alloys  Ai_,B,C  of  zincblende  semiconductors  AC  and  BC  are 
considered  in  this  section;  they  are  not  amorphous  materials,  as  they  still 


TABLE  XIV 


Bulk  moduli  of  ordered  binary  alloys  AB  of  the  diamond  semiconductors  A  and  B 

From  Theories  and  Experiment 


SiC 

PP-PW' 

Experiment 

B(C) 

50.3 

44.23 

B(Si) 

9.53 

9.92 

B(SiC) 

23.4 

22.4 

AB/S(%) 

-21 

-17 

SiGe 

PP-PW' 

pp.pw‘ 

ASA' 

FP-LMTO' 

B(Si) 

9.53 

9.8 

8.80 

9.58 

B(Ge) 

7.75 

7.7 

6.25 

7.05 

B(SiGe) 

8.73 

8.7 

7.38 

8.31 

AB/S(%) 

1 

0 

2 

0 

'Martins  and  Zunger  (1986). 
‘Qteish  and  Resta  (1988). 
'van  Schilfgaarde  (1990). 
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possess  their  constituent  diamond  and  zincblende  lattices,  respectively,  as 
characterized  by  their  crystal  diffraction  patterns.  We  shall  consider  the 
pseudobinaries  first.  The  alloying  atoms  A  and  B  in  these  alloys  belong  to  a 
fee  sublattice,  and  the  C  atoms  to  the  other  sublattice.  However,  the  positions 
of  the  A  and  B  atoms  are  not  necessarily  locked  precisely  on  the  lattice  sites. 
The  extended  x-ray  absorption  fine-structure  spectroscopy  (EXAFS)  data 
(Mikkelsen  and  Boyce,  1982  and  1983;  Boyce  and  Mikkelsen,  1985;  Balzar- 
otti  et  al,  1985)  have  consistently  shown  a  bimodal  distribution  of  the  bond 
lengths  in  these  alloys,  although  the  average  lattice  constant  follows  the 
Vegard  (1921)  law  a  =  (1  —  x)aAc  +  ^^bc-  Figure  5  shows  an  example  of  the 
results  for  the  bond  lengths  in  Gaj  .^^In^^s  deduced  by  Mikkelsen  and  Boyce 


Fig.  5.  Near-neighbor  bond  lengths  (GaAs  and  InAs  in  the  Ga,  _  alloy)  as  a  function 
of  composition  x,  measured  by  EXAFS  (Mikkelsen  and  Boyce,  1982). 


212 


58 


A  -B  CHEN.  ARDEN  SHER  AND  W  T  YOST 


(1982)  from  their  EXAFS  data.  In  a  first  approximation,  the  crystal  structure 
of  an  alloy  can  be  viewed  as  having  the  A  and  B  atoms  on  their  fee  sublattice 
with  an  average  lattice  constant,  while  the  C  atoms  are  distorted  away  from 
their  lattice  sites,  in  a  way  similar  to  the  local  structures  of  the  three  ordered 
superlattices  considered  earlier.  The  difference  is  that  there  is  no  long-range 
superlattice  ordering  of  the  A  and  B  atoms  in  the  disordered  state.  This 
simple  crystal  picture  is  only  a  first  approximation;  the  EXAFS  experiments 
just  cited  also  indicate  that  the  sublattice  of  the  A  and  B  atoms  is  less  than  a 
perfect  fee.  There  are  also  theoretical  calculations  (e.g.,  Sher  et  al,  1987;  Wei 
et  al.,  1990)  that  suggest  a  certain  degree  of  short-range  ordering  in  these 
alloys,  namely  that  the  arrangement  of  the  A  and  B  atoms  is  not  random. 
Figure  6  shows  an  example  of  the  calculated  deviations  (Sher  et  al.,  1987)  of 
the  probabilities  from  random  distribution,  Ap„  =  p„  —  for  Gaj  ^^In^^As  as 
a  function  of  alloy  concentration  x,  where  p„  is  the  probability  of  having  n  Ga 
atoms  and  4-n  In  atoms  on  the  vertices  of  a  local  tetrahedral  cluster  in  the 
alloy,  and  =  4C„(1  —  x)"x^‘",  where  ^C„  is  a  binomial  coefficient,  is  the 
corresponding  value  for  the  random  distribution. 

The  structural  energy  needed  for  calculating  the  elastic  constants  of  a 
disordered  alloy  is  an  ensemble  average  of  the  total  energy  over  the 
distribution  of  the  alloying  atoms  under  strains.  It  has  been  demonstrated 
(Ferreira  et  al.,  1989)  that  the  total  energy  of  a  semiconductor  can  be 
decomposed  into  the  sum  of  multisite  correlation  energies,  from  the  single¬ 
site,  the  pair,  and  up  to  a  cluster  containing  a  handful  of  sites.  In  other  words, 
the  multisite  correlation  energies  converge  to  zero  quickly  at  a  manageable 
number  of  sites.  This  implies  that  the  structural  energy  of  an  alloy  is  an 
average  of  these  multisite  correlation  energies.  Connolly  and  Williams  (1983), 
working  on  metal  alloys,  proposed  that  these  multisite  correlation  energies  be 
deduced  from  the  ordered  systems  that  are  composed  of  the  same  atoms.  This 
scheme  allows  a  direct  application  of  the  first-principles  theory  in  the 
calculation  of  the  energy  parameters.  These  energetics  can  then  be  used  in  the 
alloy  statistics  such  as  in  the  cluster  variational  method  of  Kikuchi  (1951)  or 
in  the  Monte  Carlo  calculations  to  deduce  the  distribution  functions  or  the 
average  properties.  This  theory  has  been  carried  out  extensively  for  semicon¬ 
ductor  alloys  by  Ferreira  et  al.  (1989),  and  respectable  results  have  been 
obtained  for  the  phase  diagrams  and  alloy  equilibrium  properties.  For  this 
theory  to  fit  the  elastic  constants  requires  detailed  dependences  of  the 
multisite  energies  under  different  strains  that  have  yet  to  be  worked  out.  Also, 
the  validity  of  using  the  energy  parameters  deduced  from  ordered  alloys  in 
the  disordered  systems  needs  to  be  examined  further. 

There  is  a  different  cluster  approach,  which  directly  relates  the  alloy 
Hamiltonian  to  the  distribution  function  (Gautier  etal,  1975;  Chen  etal, 
1987;  Berera  et  ai,  1988;  Dreysse  etaU  1989).  In  this  approach  one  focuses 
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Fig.  6.  Cluster  populations  relative  to  those  in  a  random  alloy  Xj  -  Xj  for  clusters  with 
=  0, 1, 2, 3, 4  B  atoms  for  a  Ga,  .^In^^As  alloy  equilibrated  at  600  K. 

a  particular  cluster  in  an  alloy  ensemble.  The  average  energy  per  cluster  can 
be  written  (Chen  et  al.,  1988)  as 

<e>  =  Z  I  ^  •  (126) 

on  where  £„  is  the  energy  of  a  cluster  detached  from  a  given  alloy  configura¬ 
tion,  and  h„„  is  the  interaction  energy,  which  is  the  change  of  energy  of  the 
combined  system  when  the  cluster  is  put  back  into  the  alloy.  In  Eq.  (126),  p„  is 
the  probability  that  a  cluster  is  of  the  type  «,  specified  by  the  number  of  A  and 
B  atoms  and  their  arrangements,  and  is  a  conditional  probability  that  the 
surrounding  environment  is  in  state  m  when  the  cluster  is  in  state  n.  The 
factor  1/2  in  Eq.  (126)  is  to  eliminate  the  double  counting  in  the  total  average 
alloy  energy  <£>  =  M<e>,  where  M  is  the  ratio  between  the  size  of  the  alloy 
and  the  cluster.  If  one  writes  <£>  =  S  p„£(«),  then  an  effective  cluster  energy 
can  be  defined  as 


(127) 
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This  procedure  is  particularly  useful  when  the  interaction  energy  is 
short-ranged.  One  can  start  with  a  small  cluster  and  a  given  probability 
distribution,  then  calculate  the  effective  cluster  energies  from  Eq.  (126).  These 
energies  are  then  used  in  a  statistical  theory  to  deduce  a  cluster  distribution, 
which  in  turn  is  used  to  calculate  a  new  set  of  £(n)  and  distribution  functions, 
and  the  process  is  iterated  until  it  converges.  It  should  be  pointed  out  that  to 
compute  the  total  energy  of  an  alloy  quantum  mechanically,  one  needs  to 
solve  the  Schroedinger  equation  for  a  Hamiltonian  that  does  not  have  the 
lattice  translational  symmetry  so  indispensable  in  traditional  band-structure 
theory.  If  the  fluctuation  of  the  alloy  potential  from  the  virtual  crystal 
approximation  (VCA),  where  the  alloy  potential  is  approximated  as  the 
concentration-weighted  average,  is  small,  then  the  next  leading  correction  to 
VCA  can  be  obtained  from  perturbation  theory.  This  should  work  for  most 
semiconductor  alloys  except  for  systems  with  large  potential  fluctuations 
such  as  Hg,_,Cd,Te  (Chen  and  Sher,  1982;  Spicer  era/.,  1982;  Hass  etal., 
1983).  A  more  general  but  more  difficult  approach  is  to  extend  the  present 
molecular  coherent  potential  approximation  (MCPA)  (Hass  et  ai,  1983)  to 
clusters  and  to  achieve  a  triple  self-consistency  (Chen  et  ai,  1987);  consisten¬ 
cy  between  cluster  distribution  and  Hamiltonian,  between  the  Hamiltonian 
and  electron  density,  and  between  the  self-energy  operator  I  in  the  cluster 
CPA  theory  and  the  potential  fluctuations.  To  date  this  theory  has  been 
carried  out  only  for  metal  alloys,  and  then  only  within  the  single-site  KKR- 
CPA  with  a  random  distribution  (Schwartz  and  Bansil,  1975;  Gyorffy  and 
Stocks,  1978).  Major  work  is  needed  to  determine  if  this  approach  can 
achieve  the  same  degree  of  rigor  for  disordered  alloys  as  self-consistent 
density  functional  theory,  which  has  been  successfully  used  in  dealing  with 
crystalline  semiconductors. 

The  above  idea  has  been  applied  to  an  elastic  medium  model  to  deduce  a 
mean  field  theory  for  the  internal  strain  and  bulk  modulus  in  semiconductor 
alloys  (Chen  et  ai,  1988).  This  theory  starts  by  assuming  that  the  alloy  has  an 
effective  lattice  constant  and  effective  modulus.  When  part  of  the  effective 
alloy  medium  is  replaced  by  a  specified  cluster,  there  will  be  strain  energy 
introduced.  It  was  shown  that  this  strain  energy  can  be  taken  as  the  effective 
energy  £(n)  for  that  cluster,  the  probability  distribution  p„  within  a  statistical 
theory  can  then  be  deduced.  The  internal  strain  energy  is  calculated  as 
E  =  M<£>  —  ML  p„£in).  When  the  alloy  is  under  an  external  pressure  SP, 
the  effective  cluster  energy  will  change  by  an  amount  Se{n),  which  implies  a 
change  of  the  total  energy  by  an  amount  SE  =  M(,de(n)).  Then  the  bulk 
modulus  of  the  alloy  can  be  obtained  from  A£  =  ^dP)^VIB,  where  V  is  the 
alloy  volume.  The  mean-field  nature  of  this  approach  is  evident  from  the  fact 
that  the  calculation  requires  knowledge  of  the  alloy  lattice  constant  and 
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elastic  constants  that  are  only  assumed  and  are  required  to  be  calculated  self- 
consistently.  To  illustrate  this  self-consistency  procedure,  let  us  consider  the 
following  simple  spring  model  for  a  random  pseudobinary  alloy 
The  cluster  corresponds  to  the  four  bonds  surrounding  an  “impurity”  atom 
A  or  B,  and  the  environment  of  the  cluster  corresponds  to  the  12  bonds  that 
connect  inwardly  to  the  cluster  and  outwardly  to  a  rigid  lattice  of  the  effective 
alloy.  It  is  worth  mentioning  that  there  have  been  detailed  analyses  of  the 
valence-field  force  models  for  the  strain  energies  of  semiconductor  alloys  in 
regard  to  the  range  allowed  for  lattice  relaxation  (Martins  and  Zunger,  1984; 
Chen  and  Sher,  1985).  It  was  found  that  by  neglecting  the  bond-angle  forces, 
one  can  use  a  shorter  range  of  lattice  relaxation  to  obtain  the  correct  mixing 
enthalpies  and  bond  lengths.  The  simple  model  considered  here  works 
amazingly  well  for  the.';e  properties. 

Let  the  spring  constants  for  the  pure  AC  and  BC  compounds  be  and  k^, 
respectively,  and  the  effective  alloy  spring  constant  be  k,  with  similar 
notations  for  the  bond  lengths  d^,  and  d.  When  an  A  atom  is  embedded  in 
the  medium,  all  16  bonds  under  consideration  will  relax,  and  the  strain 
energy  is  given  by 


=  (128) 

where 

k,=4kM^kA  +  k).  (129) 

A  similar  energy  e(B)  is  obtained,  when  a  B  atom  is  embedded.  The  effective 
bond  length  d  is  obtained  from  a  minimization  of  the  average  cluster  energy 
£  =  (1  —  x)£(A)  -I-  x£(B)  with  respect  to  d,  which  yields 

d  =  -  x)kid^  +  x/cj^bD/KI  -  x)ki  -I-  xkj]  •  (130) 

When  the  alloy  is  compressed,  the  alloy  bond  length  is  reduced  to  d(l  —  e), 
where  ^  is  a  macroscopic  strain  corresponding  to  the  external  pressure.  The 
pressure-induced  strain  energy  for  the  16  bonds  in  the  medium  is  SE  — 
%k(de)^,  and  SE  =  Ik^de)^  for  each  cluster.  Embedding  an  A  atom  in  this 
compressed  medium,  one  finds  the  total  strain  energy  for  the  16  bonds  to  be 

E^  =  ]^k,{d-dj,~Adef.  (131) 
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To  obtain  the  extra  cluster  energy  <5£(A)  induced  by  the  pressure,  we  subtract 
£(A)  of  Eq.  (127)  and  the  background  energy  for  the  surrounding  12  bonds 
from  to  give 

^£(A)  =  -4ki(d  -  d^)de  +  Sk^idey  -  ekidef .  (132) 

Similarly,  the  following  expression  for  ^£(8)  is  obtained  when  the  embedded 
atom  is  B: 


<5£(B)  =  —4k2(d  —  d^)de  +  ^k^idey  —  6k(dey .  (133) 

Thus,  the  change  of  the  average  cluster  energy  that  is  due  to  the  pressure  is 
given  by  3E  =  <^Se(n)y  =  (1  —  x)^£(A)  +  x5£(B),  which,  when  equated  to 
Ikidey,  leads  to  the  following  self-consistent  equation  for  the  effective  spring 
constant  k  :k  =  (\  —  x)/ci  +  xk  2-  The  k  can  now  be  solved  analytically  when 
both  the  expression  for  ki  in  Eq.  (129)  and  the  similar  expression  for  ^2  are 
used.  The  result  is 


k  =  </c>[l  -  3x(l  -  xmKkyy} ,  (134) 

where  </c>  =  (1  —  x)k^  +  x/cg  is  the  mean  spring  constant  and  dk  =  k^  —  k^ 
the  difference.  It  is  interesting  to  compare  this  result  for  the  50/50  alloy,  i.e., 
k  =  k{l  —  jA^)  with  the  value  k(l  —  Al/4)  in  Eq.  (123)  for  the  ordered  alloys 
in  the  CuAuI  and  chalcopyrite  structures.  The  alloy  spring  constant  is  slightly 
below  the  straight-line  average,  and  the  bowing  is  larger  for  a  disordered 
alloy  than  for  the  corresponding  ordered  compound.  Using  the  effective 
spring  constant  of  Eq.  (134),  we  find  that  the  effective  bond  lengths  for  most 
alloys  also  bow  slightly  below  their  mean  value, 

d=idy+  4x(l  -  x){d^  -  deX^A  -  k^)k/iOk^  +  k)i3k^  +  k)]  (135) 

because  the  spring  constant  tends  to  increase  as  the  bond  length  decreases. 

To  compare  the  calculations  above  with  experimental  data  for  pseudobin¬ 
ary  alloys,  we  were  able  to  find  results  for  GaAlAs  (Landolt-Bomstein,  1988), 
CdZnTe,  CdMnTe,  and  HgCdTe  (Quadri  etai,  1986).  For  GaAlAs,  the 
following  linear  x  dependences  were  measured  (Landolt-Bomstein,  1988): 
C,i  =  1 1.85  -H  0.14x,  C,2  =  5.38  +  0.32x,  and  Q*  =  5.94  -  0.05x.  This  lack 
of  detectable  bowing  is  expected,  because  of  the  nearly  equal  bond  lengths  of 
the  two  constituent  compounds  and  small  differences  in  the  elastic  constants. 
The  bulk  moduli  in  the  three  II-VI  alloy  systems  mentioned  were  obtained 
from  high-pressure  x-ray  diffraction  data.  For  HgCdTe,  the  results  are  similar 
to  those  for  the  GaAlAs  in  that  both  the  bond  lengths  and  the  bulk  moduli  of 
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TABLE  XV 

Measured  elastic  constants  in  lo"  dynes/cm^  of  SiGc  alloys  by  Bublik  £tal  (1974) 


Alloy 

Cl  1 

C.2 

C44 

Sio. 28^^0.72 

16.1  ±  0.8 

8.35  ±  0.8 

8.55  ±  0.4 

Sio.54Geo,46 

17.0  ±  0.8 

Sio,46^So  36 

17.1  ±  0.8 

HgTe  and  CdTe  are  so  close  that  the  differences  in  B  between  the  alloys  and 
the  pure  crystals  were  beyond  the  experimental  resolution.  However,  a  5%  Zn 
in  CdZnTe  alloy  was  found  to  give  a  15%  increase  in  the  B  value  from  the 
pure  CdTe  value  and  a  10%  Mn  in  CdMnTe  gave  a  21%  decrease  (Quadri 
et  al.,  1986).  These  significantly  large  changes  in  the  B  values  caused  by 
smaller  concentrations  cannot  be  explained  from  the  above  considerations. 

The  qualitative  model  considered  above  does  not  apply  to  the  binary  alloys 
Aj  because  in  these  alloys  both  the  A  and  B  atoms  can  be  found  in  both 
sublattices,  and  the  local  bond  length  arrangement  is  more  complicated  than 
the  pseudobinary  alloys.  However,  one  can  expect  that  there  are  still  more 
degrees  of  relaxation  in  the  disordered  binaries  than  in  the  ordered  com¬ 
pounds.  Therefore,  one  would  conclude  that  the  bulk  modulus  of  the 
disordered  50-50  SiGe  alloy  would  have  a  smaller  value  than  those  tabulated 
in  Table  XIII  for  the  ordered  compounds.  At  least  one  would  not  expect  the 
alloy  B  values  to  be  significantly  larger  than  the  mean  values  B.  However,  the 
only  experimental  data  available  (Bublik  et  ai,  1974),  Table  XV,  show  that 
all  three  elastic  constants  for  these  alloys  at  three  different  concentrations 
exceed  the  values  for  Si,  and  that  AB/B  is  as  large  as  20%,  despite  the  fact  that 
the  bond  length  difference  between  Si  and  Ge  is  only  about  3%  and  the 
measured  alloy  lattice  constants  are  only  bowed  slightly  below  the  average. 
This,  and  the  unexplained  results  for  the  II-VI  alloys,  point  to  the  need  for  a 
more  systematic  study  of  the  elastic  properties  of  semiconductor  alloys,  both 
experimentally  and  theoretically. 


IV.  Dislocations  and  Hardness” 

Hardness  has  proven  to  be  a  useful  probe  of  the  mechanical  properties  of 
the  brittle  semiconductors.  Here  we  will  use  the  term  hardness  to  refer 
specifically  to  Vickers’  hardness,  unless  otherwise  noted.  In  the  Vickers’ 

®Much  of  this  section  is  adapted  from  “Final  Report”  (AFOSR-F49620-85-C)023)  by  M.  A. 
Berding  (1988),  SRI  International,  Menlo  Park,  California. 
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hardness  measurement,  a  square  pyramidal  indenter  is  used,  and  the  hardness 
number  is  given  by  the  applied  load  divided  by  the  area  of  the  indentation 
(i.e.,  units  of  pressure).  Hardness  has  been  found  to  be  an  intrinsic  property  of 
the  material,  because  it  is  relatively  independent  of  the  applied  load.  One 
advantage  of  hardness  measurements  for  semiconductors  is  that,  in  contrast 
to  bending  tests,  only  small  samples  are  necessary  for  conventional  Vickers’ 
hardness  measurements,  or  for  nanoindenter  measurements  (Fang  et  ai, 
1990).  and  relatively  thin  epitaxial  films  can  be  probed.  Additionally,  unlike 
conventional  tests  used  to  measure  yield  stress,  hardness  measurements  can 
be  made  at  room  temperature,  which  is  far  below  the  usual  plastic  regime  for 
most  semiconductors.  As  such,  the  hardness  measurement  provides  a  conve¬ 
nient  and  usable  probe. 

The  question  remains,  though,  as  to  the  interpretation  of  the  hardness 
measurement  in  semiconductors:  Just  what  property  or  properties  of  a 
semiconductor  are  we  measuring  when  we  measure  hardness?  In  metals,  an 
empirical  relationship  is  found  between  the  hardness  H  and  yield  stress  Y, 
such  that  H  =  3Y. 

In  metals,  this  relationship  can  be  justified  on  the  basis  of  continuum 
theory,  as  discussed  in  McClintock  and  Argon  (1966).  In  semiconductors 
such  a  simple  relationship  between  H  and  Y  is  not  necessarily  appropriate  for 
several  reasons.  During  deformation  in  metals,  many  slip  planes  can  be  active 
because  the  Peierls  barriers  for  dislocation  motion  in  most  directions  are  low. 
In  contrast,  because  the  bonds  in  semiconductors  are  strongly  covalent,  the 
Peierls  barriers  are  high  and  dislocation  in  the  (111)1/2<T10>  slip  system 
dominate. 

To  date,  there  is  no  complete  quantitative  theory  of  hardness  in  the 
semiconductors  in  which  the  temperature  dependence,  photoplastic  effect, 
and  the  alloy  hardening  effect  are  included.  Sher  et  al.  (1985)  proposed  a 
model  of  hardness  for  the  semiconductor  compound  that  gives  good  quanti¬ 
tative  agreement  with  experiment,  but  this  model  does  not  provide  an 
explanation  for  several  of  the  observed  dependences  of  hardness.  This  model 
of  hardness  in  semiconductors  differs  from  more  conventional  interpretations 
and  suggests  that  hardness  is  dominated  by  dislocation-dislocation  interac¬ 
tions,  as  opposed  to  dislocation  activation  and  motion  terms.  We  discuss  the 
results  of  an  improved  quantitative  model  of  hardness  below. 


12.  Sup  Systems 

In  the  dislocation  interaction  hardness  model,  Vickers’  hardness  is  found 
to  be  dominated  by  the  interaction  energy  of  an  idealized  array  of  dislo¬ 
cations  that  has  been  generated  by  the  indenter.  The  idealized  array 
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can  be  considered  as  a  first  approximation  to  the  more  realistic  dislocation 
tangles  found  experimentally,  leading  in  higher  order  to  an  expansion  in 
dislocation  configurations.  In  this  idealized  array,  no  account  is  taken  of  the 
true  slip  systems  active  in  the  semiconductors.  Experiments  (Hirsch  et  al, 
1985)  have  demonstrated  that,  in  Vickers’  hardness,  slip  occurs  primarily  on 
the  {111}t<T10>  glide  set,  where  the  threefold  symmetry  of  slip  and  rosette 
lines  occurs  at  the  intersection  of  the  (111}  planes  with  the  (111)  surface. 

For  indentation  on  the  (111)  plane,  dislocations  can  glide  on  the  (111) 
plane  parallel  to  the  surface  or  on  one  of  the  three  other  (111)  planes  with  a 
total  of  four  active  slip  planes.  Although  the  detailed  analysis  differs  from  that 
given  previously  (Sher  et  al,  1985),  the  contribution  to  the  hardness  from  the 
interaction  energy  is  comparable  to  that  previously  calculated.  This  contribu¬ 
tion  to  H  is  directly  proportional  to  the  shear  coefficient. 


13.  Peierls  Energy 

The  Peierls  energy  is  difficult  to  calculate  precisely  because  of  dislocation 
charge  effects  and  reconstruction  at  the  dislocation  core.  In  the  context  of  the 
hardness  measurement  we  calculate  the  Peierls  energy  in  order  to  evaluate 
the  importance  of  this  contribution  to  the  Vickers’  hardness  number. 
Although  it  is  generally  agreed  that  dislocations  in  semiconductors  move 
through  the  generation  and  propagation  of  double  kinks,  in  the  hardness 
measurement,  the  region  about  the  indenter  is  grossly  plastically  deformed. 
Because  the  dislocation  velocity  is  low  at  room  temperature  (see  below),  the 
large  dislocation  pile-up  model  proposed  by  Sher  et  al  (1985)  may  be 
appropriate.  If  the  dislocation  separation  is  small,  dislocation  motion 
through  kink  processes  will  be  suppressed  and  the  dislocations  will  propagate 
as  a  complete  unit. 

To  get  from  Configuration  A  to  Configuration  B  in  Fig.  7,  we  must  break  a 
row  of  bonds.  Since  the  long-range  strain  fields  should  be  comparable  in  the 
two  configurations  as  well  as  in  intermediate  configurations,  the  Peierls  force 
can  be  calculated  from  local  energy  considerations  only.  The  energy  to  break 
a  bond  at  the  dislocation  core  is  approximately  given  by 

V„  =  2^vl  +vl  +  2£„.,  -  +  ?(£;  -  £‘),  (136) 

where  Fj  is  the  covalent  energy,  Fj  the  ionic  energy,  £„e,  the  metallization 
energy,  F^  the  bond  overlap  energy,  n  is  1  for  III-V  and  2  for  II-VI 
compounds,  and  eH  and  the  hybrid  energy  for  the  anion  and  cation, 
respectively.  The  first  two  terms  in  Eq.  (136)  account  for  the  loss  of  the 
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PLANE 


PLANE 


Fig.  7.  Atom  configurations  during  the  slip  of  a  dislocation,  (a)  Configuration  A;  (b) 
Configuration  B. 


bonding  energy  of  the  two  electrons  in  the  breaking  bond,  the  third  term 
accounts  for  regaining  the  repulsive  interaction  energy  of  the  bond,  and  the 
fourth  term  accounts  for  the  energy  gain  to  transfer  electrons  back  from  the 
cation  to  the  anion.  We  note  that  the  electron  orbitals  of  the  atoms  at  the 
dislocation  core  are  left  in  the  sp^  hybrids  after  the  bond  breaking.  The 
expression  in  Eq.  ( 1 36)  represents  a  theoretical  maximum  of  the  Peierls 
energy,  since  no  reconstruction  at  the  core  has  been  included. 

To  calculate  the  Peierls  energy  per  unit  length,  we  consider  a  primary 
dislocation  in  the  <TlO>  direction  in  a  zincblende  compound.  The  number  of 
bonds  per  unit  length  in  <TlO>  is  given  by  1/h,  where  b  is  Burger’s  vector. 
Thus,  the  Peierls  energy  per  unit  length  is  given  by 


where  d  is  the  bond  length. 

We  can  now  calculate  the  Peierls  force,  or  the  force  per  unit  length 
necessary  to  move  a  dislocation  over  the  potential  barrier,  as  illustrated  in 
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Fig.  8.  The  Peierls  energy  is  related  to  the  Peierls  force  through 


£ 


p 


F.L, 


where 


L  = 


b 

2 


(138) 


(139) 


is  the  distance  between  Configuration  A  and  Configuration  B.  Solving  for  £p 
in  terms  of  and  d,  we  arrive  at; 


F 


p 


_3^ 


(140) 


or 


T 


P 


28  d^' 


(141) 


Values  for  Uf,  and  tp  are  summarized  in  Table  XVI. 

Now  we  incorporate  the  Peierls  energy  into  the  hardness  model  for  low 
temperature  where  the  full  barrier  must  be  surmounted.  The  Sher  model  is 
based  on  energy  considerations.  The  Vickers’  hardness  number  is  given  by 
the  applied  force  divided  by  the  area  of  indentation.  Multiplying  the 
numerator  and  denominator  by  h,  the  depth  of  indentation,  we  have 


Fig.  8.  Schematic  of  the  dislocation  potential  as  a  function  of  its  position. 
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TABLE  XVI 


Calculated  Peierls  stress  and  hardness  for  various  zincblende  semiconductors,  with 

EXPERIMENTAL  HARDNESS  VALUES  FOR  COMPARISON* 


"p 

Wp  + 

«..p 

c 

11.35 

23,300 

1940 

9244 

11,184 

10,000 

Si 

5.95 

3430 

286 

1098 

1384 

1370 

Ge 

6.67 

3440 

286 

893 

1179 

1000 

Sn 

5.66 

1930 

161 

— 

— 

— 

AlP 

6.04 

3410 

284 

— 

— 

— 

GaP 

6.15 

3500 

292 

903 

1195 

940 

InP 

5.78 

2640 

220 

548 

768 

520 

AlAs 

5.90 

3000 

250 

— 

— 

505 

GaAs 

6.03 

3000 

256 

750 

1006 

580 

InAs 

5.64 

2340 

195 

469 

664 

430 

AlSb 

5.08 

2840 

237 

524 

761 

400 

GaSb 

5.36 

2180 

182 

553 

735 

450 

InSb 

4.92 

1670 

139 

365 

504 

230 

ZnS 

5.14 

3000 

250 

515 

765 

— 

CdS 

4.77 

2750 

229 

288 

517 

— 

HgS 

5.68 

2590 

216 

— 

~ 

_ 

ZnSe 

4.92 

250C 

209 

462 

671 

137 

CdSe 

4.59 

1890 

157 

254 

411 

— 

HgSc 

4.523 

1840 

154 

232 

386 

— 

ZnTe 

4.42 

1720 

143 

374 

517 

82 

CdTe 

4.01 

1350 

113 

222 

335 

60 

HgTe 

3.99 

1360 

113 

230 

343 

25 

•The  U  values  are  in  eV,  and  the  others  in  kg/mm^. 


H  =  EliW^h),  where  E  =  Fh  is  the  energy  of  indentation,  and  h  is  the  depth 
of  the  indentation.  Including  the  interaction  energy  only  we  have 

r,  Gcot0  r  ,  fcol6\  4  .  2  ^1 

where  6  is  one  half  the  indenter  angle.  To  include  the  Peierls  energy,  we 
consider  the  total  energy  necessary  to  move  the  dislocations  from  their  initial 
to  final  positions  in  the  idealized  model.  The  hardness  is  then  given  by: 

W  =  +  (143) 
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where 


H 


p 


W^h 


(144) 


is  the  Peierls  contribution  to  the  hardness,  and  £p  is  the  total  Peierls  energy 
expended.  The  total  length  of  dislocation  to  be  moved  is  calculated  to  be 


1  n 


(145) 


The  total  Peierls  energy  is  given  by 


t/b  ,  1  t/bW'" 


^p  =  ir^  =  2 


6 


cos^  6 . 


(146) 


Thus,  we  have 


For  e  =  45% 


H, 


3 


cos  0  sin  0 . 


(147) 


H 


p 


6  b^' 


(148) 


Values  of  Hp  are  summarized  in  Table  XVL  Several  features  of  Hp  should 
be  noted.  First,  we  have  used  a  zero-temperature  value  of  the  Peierls  energy. 
Because  hardness  measurements  are  typically  done  at  room  temperature,  one 
should  take  the  thermal  energy  into  account;  this  will  reduce  the  values  of  Hp 
from  those  listed  in  Table  XVI.  Also  shown  in  Table  XVI  are  Hi„,,  +  Up, 

the  best  theoretical  estimate  for  H,  and  Note  that,  like  Hp  is 
independent  of  the  applied  load,  in  agreement  with  experiment.  Also  note 
that  Hp  improves  the  agreement  between  theory  and  experiment  for  the  hard, 
nonpolar  materials.  For  the  softer,  more  ionic  materials,  H  is  overestimated 
by  the  theory.  The  overestimation  of  H  may  be  because  of  neglect  of 
dislocation  velocity  effects  and  their  temperature  dependence,  as  discussed 
below. 
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14.  Temperature  Dependence 

Here  we  summarize  the  experimental  results  and  discuss  a  tentative  theory 
of  the  temperature  dependence  of  the  hardness. 

Several  recent  studies  on  the  temperature  dependence  of  hardness  serve  to 
illustrate  the  behavior.  Results  for  GaAs  and  Ge  are  shown  in  Fig.  9.  The 
(111)  and  ( 100)  faces  of  GaAs  have  been  examined  by  Hirsch  et  ai  ( 1 985)  and 
Guruswamy  et  a/.  (1986),  respectively.  Results  for  the  Knoop  hardness  on  the 
(100)  face  of  n-type  Ge  are  also  shown  (Roberts  et  al.,  1986).  The  (100)  face  of 
GaAs  and  the  Ge  show  a  definite  temperature  dependence  with  a  relatively 
temperature-independent  region  for  T  <  450  K  and  an  exponential  tempera¬ 
ture  dependence  for  T  >  550  K: 


H  =  (149) 

with 

l/s0.24eV  (150) 

for  (100)  GaAs.  The  results  for  GaAs  (111)  appear  to  follow  a  similar 
behavior. 

The  temperature  dependence  of  hardness  suggests  that  two  different 
mechanisms  may  determine  hardness  in  the  two  temperature  regimes.  At  low 
temperature,  the  hardness  is  nearly  independent  of  temperature  and  may  be 
limited  primarily  by  dislocation  interactions.  Dislocation  mobility  is  low  at 
low  temperatures,  and  the  tendency  for  dislocation  pile-up  is  high.  At 
elevated  temperature,  the  dislocation  mobility  is  increased,  so  that  disloca¬ 
tions  move  more  readily  under  an  applied  stress.  Therefore,  at  higher 
temperatures,  dislocation  pile-up  is  reduced  and  the  hardness  is  limited  by 
lattice  friction,  which  shows  a  strong  temperature  dependence. 


V.  Concluding  Remarks 

The  experimental  methods  available  to  measure  elastic  constants  vary 
greatly  in  their  accuracy  and  in  the  size  of  samples  required.  Generally,  those 
that  measure  the  velocity  of  sound  are  quite  accurate,  some  yielding  elastic 
constants  to  one  part  in  10^.  These  methods  are  also  capable  of  measurinr, 
higher-order  elastic  constants,  a  subject  not  treated  in  this  paper.  However, 
the  samples  required  for  these  measurements  must  be  large,  of  the  order  of 
several  centimeters,  and  must  be  perfect  bulk  single  crystals.  Many  semicon¬ 
ductors,  alloys  in  particular,  are  only  grown  as  thin  films  on  disparate 
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Fig.  9.  Measured  hardness  of  Ge  and  several  GaAs  samples  as  a  fimction  of  temperature. 


substrates.  For  these  examples,  the  velocity-of-sound  methods  fail,  and  the 
less  accurate  Raman  and  Brillouin  scattering  techniques  become  the  methods 
of  choice.  Their  accuracy  is  about  1  to  4%,  which  is  adequate  for  many 
practical  applications. 

Most  group  IV,  III-V  compound,  and  II- VI  compound  semiconductors 
have  been  studied,  and  their  elastic  constants  tabulated.  A  few  remain  to  be 
examined,  and  several  should  be  reexamined  because  different  experimenters 
do  not  agree  on  the  results.  The  situation  in  the  pseudobinary  alloys  is  quite 
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different.  Few  alloy  systems  have  been  adequately  studied;  those  studied  have 
mostly  fallen  into  the  class  of  materials  in  which  the  bond  lengths  of  the 
constituents  nearly  match.  More  interesting  results  are  expected  from  alloys 
with  a  bond  length  mismatch.  Such  studies  would  yield  a  wealth  of  informa¬ 
tion  on  mechanisms  responsible  for  correlations  in  these  alloys,  and  perhaps 
even  on  those  responsible  for  producing  the  ordered  alloys  that  have  been 
grown  recently. 

We  have  emphasized  the  utility  of  various  parameterized  models  for 
treating  nonideal  situations.  However,  the  most  powerful  new  theoretical 
developments  are  in  the  area  of  first-principles  theories.  The  advent  of  self- 
consistent  local  density  theories  (Hohnberg  and  Kohn,  1964)  more  than 
20  years  ago,  and  the  advances  in  methods  to  solve  the  Schroedinger 
equation,  are  making  real  applications,  as  evident  from  the  excellent  struc¬ 
tural  and  elastic  properties  produced  by  these  theories  (see  Table  II  and 
Anderson  et  al,  1985).  Recent  progress  in  the  full-potential  LMTO  method 
(Methfessel  and  van  Schilfgaarde  et  al,  1990),  in  the  non-self-consistent 
approach  (Harris,  1985;  Foules  and  Haydock,  1989;  van  Schilfgaarde  era/., 
1991),  and  the  Carr-Parrinello  (1985)  quantum  molecular  dynamics  exten¬ 
sion  has  increased  solution  speeds  into  the  realm  in  which  it  is  practical  to 
attack  many  mechanical  property  problems.  Moreover,  LDA  has  now  been 
extended  to  include  many-body  corrections  (Hybertsen  and  Louie,  1987),  so 
properties  that  are  sensitive  to  conduction  bands  can  now  be  computed 
accurately.  As  computational  speeds  continue  to  increase,  these  methods  will 
evolve  into  practical  engineering  tools. 
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Abstract 

We  calculated  excess  pair  energies  of  GaAlAs  alloys  in  the  bulk  and  on  the  surface  of  a  semi-infinite 
crystal  in  a  generalized  perturbation  method  using  a  third-neighbor  tight-binding  Hamiltonian  parameters 
derived  from  an  ab-initio  density  functional  theory.  This  Hamiltonian  has  produced  good  electronic  and 
structural  properties  for  GaAs  and  AlAs.  Calculated  excess  pair  energy  for  the  bulk  alloy  is  nearly  zero, 
suggesting  a  disordered  equilibrium  bulk  state,  and  that  for  the  Ga-Al  pair  on  the  (100)  surface  is  found 
to  be  strongly  negative.  The  cr  bonds  are  found  to  be  three  times  more  attractive  than  the  x-bonds, 
which  strongly  sug';;est  the  tendency  of  ordering  in  the  CuAu  I  structure  observed  experimentally.  From 
the  magnitude  of  this  attractive  pair  interaction  the  critical  temperature  for  the  order-disorder  transition 
is  estimated  to  be  900K. 
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Ordering  in  semiconductors  is  of  both  scientific  and  technological  interest.  The  phase 
transition  induced  by  strain,  chemical  energy  differences,  surfaces  are  of  vigorous  scientific 
study.  Properties  of  alloys  grown  for  materials  engineering  such  as  improved  band  gap. 
structural  and  electronic  properties  are  altered,  and  mostly  favorably,  by  ordering.  For 
example,  alloys  grown  for  improved  electron  mobility,  will  exhibit  further  dramatic  increase- 
due  to  ordering. 

(An  Ban::  Please  write  here  a  para  or  two  on  status  of  experimental  and 
theoretical  research  on  ordering  in  semiconductors  and  difficulties  to  explain 
ordering  GaAlAs) 

Previous  calculation  of  sublimation  energy  of  surface  atoms  from  two  extreme  surface 
coverages  concluded  that  less  energy  was  required  to  remove  a  Ga  atom  from  a  completed 
than  from  nearly  empty  (100)  GaAs  surface  (Krishnamurthy  et  al.,  1990a).  This  suggested 
a  possibility  of  repulsive  Ga-Ga  surface  interactions  with  Ga  surrounded  by  vacancies 
as  a  lower  energy  state.  The  repulsive  interactions  were  interpreted  to  be  caused  by 
charge  transfer  between  dangling  Ga  and  newly  exposed  As  bonds.  In  such  a  case,  if  Al 
atoms  are  preferred  at  vacant  sites  due  to  its  lower  dangling  bond  level  than  Ga,  then 
growth  of  ordered  GaAlAs  is  possible  by  low  temperature  growth  methods  such  as  MBE. 
A  preliminary  calculations  carried  out  with  empirical  band  structures  strongly  suggested 
such  a  possibility  (krishnamurthy  et  al.,  1991). 

The  empirical  band  structures,  even  when  they  are  carefully  constructed  for  the  cases 
studied,  are  limited  in  versatility  and  accuracy,  while  most  of  the  existing  first  principles 
calculations  are  computationally  too  demanding  to  study  realistic  problems.  The  best 
and  numerically  tractable  first  principle  theory  is  given  by  the  local  density  approxima¬ 
tion  (LDA)  (Hohenberg  and  Kohn,  1964;  Kohn  and  Sham,  1965).  Widely  used  accurate 
methods  to  solve  LDA  equations  either  by  plane  wave  method  (Ihm  and  Cohen,  1980;  Niel¬ 
son  and  Martin,  1985;  Wang  et  al.,  1989)  or  by  linearized  muffin-tin  orbitals  (Anderson, 
Jepsen,  and  Sob,  1987;  Methfessel  and  van  Schilfgaarde,  1990,  Christensen  et  al.,  1990), 
and  linearized  augmented  plane  waves  (Zunger  and  Freeman,  1977;  Mbaye,  Ferreira,  and 
Zunger,  1987)  are  discouragingly  time  consuming  for  the  approach  considered  here. 

We  use  the  linear  combination  of  atomic  orbitals  (LCAO)  method  in  a  tight-binding 
form  whose  parameters  are  obtained  from  first-principles  (Jenson  and  Sankey,  1987,  1989; 
Sankey  and  Niklewski,  1989;  Chelikowsky  and  Louie,  1984;  Vanderbilt  and  Louie,  1984). 
This  Hamiltonian  uses  pseudo-atomic-orbitals,  computed  self-consistently  from  a  free-atom 
calculation  within  LDA,  as  the  basis.  The  bulk  band  structures  and  total  energies  are 
calculated  with  an  assumption  that  the  charge  density  in  the  solid  is  the  superposition 
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of  atomic  charge  densities(Harris,  1985).  Due  to  charge  transfer  in  the  solid,  the  wave 
function  in  the  III-V  compounds  are  more  localized  than  the  corresponding  atomic  wave 
function.  Linear  combination  of  atomic  function  do  not  adequately  describe  the  wave 
function  in  the  solid.  The  basis  functions  are  made  more  compact  by  multiplying  with  a 
function  (Methfessel,1990) 


1 

(c^(r-ro)  +  1) 

and  renormalizing  it.  These  compress  parameters,  ^  and  tq  have  been  varied  to  obtain  a 
local  maximum  in  cohesive  energy.  As  a  general  rule  of  guidance,  we  start  with  twice  the 
value  of  Pauling  radius  for  tq,  and  a  value  of  2.5  for  /?.  The  obtained  electronic  structure 
and  structural  properties  such  as  equilibrium  bond  length,  cohesive  energy  per  bond,  and 
bulk  moduli  are  in  excellent  agreement  (Krishnamurthy  et  al.,  1992)  with  experiments  and 
other  calculations. 

The  studies  of  preferred  phases  during  layer-by-layer  crystal  growth  involves  minimi¬ 
zation  free  energy  with  various  parameters  such  as  alloy  concentration,  order  parameter, 
temperature  and  so  on.  Free  energy  contains  two  parts,  viz.,  enthalpy  and  entropy.  Ent¬ 
halpy  is  calculated  from  the  excess  energy  necessary  to  form  a  cluster  of  two  or  more  atoms. 
Entropy  is  obtained  from  the  number  of  ways  of  arranging  such  a  cluster.  Many  models 
are  used  to  calculate  these  cluster  energies  (Zunger,  Das  sarma,  Krishnamurthy).  In  this 
letter,  we  report  an  accurate  evaluation  of  these  energies  with  a  mathematically  rigorous 
formalism  and  first  quantitative  explanation  for  observed  ordering  in  GaAlAs  alloys. 

We  calculate  the  excess  energies  in  an  approach  known  as  generalized  perturbation  me¬ 
thod  (GPM),  which  is  computationally  straightforward  with  TB  Hamiltonian  and  is  based 
on  Green’s  function  (GF)  formalism.  This  method  can  efficiently  calculate  cluster  ener¬ 
gies  of  arbitrary  size  and  has  been  successfully  used  in  phase  diagram  studies(Ducastelle 
et  al.,  1976,Turchi  et  al.,  1987).  Its  generalization  to  use  first  principles  Hamiltonian  with 
nonorthogonal  basis  is  given  here  in  brief. 

With  orthogonal  basis  set  a  Greens  function  G  is  conventionally  defined  such  that 
{E  -f  iS  -  H)G  =  1  is  satisfied.  When  the  overlap  matrix  S  is  different  from  unit  matrix, 
the  corresponding  GF  (7^  will  satisfy  {ES  iS  -  H)G”^  =  S.  It  can  be  shown  (JAM, 
Lohez-|-Lannoo)  that  G'^  plays  the  same  role  as  conventional  G  in  the  calculation  of 
physical  quantities  such  as  density  of  states  and  total  energy.  For  example,  the  density  of 
states  and  total  density  of  states  are  respectively  Im  Tr((?”*)  and  ^  Im  Tr  (ln((j^)). 
If  we  define  an  intermediate  GF,  G^  such  that  {ES  +  i6  -  H)G^  =  1,  the  required  G"*  is 


G^S.  We  calculate  the  surface  and  bulk  G*  by  simply  replacing  (E-H)  by  (ES-H)  in  the 
difference  equation  method  (Chen  et  ah,  1989). 

The  calculation  of  excess  energies  in  GaAlAs  invariably  involves  clusters  of  Ga  and  Ah 
In  GPM,  the  reference  medium  is  random  alloy  and  most  suitable  method  of  obtaining 
band  structure  of  such  an  alloy  is  by  the  coherent  potential  approximation  (CPA).  Once  a 
CPA-alloy  is  constructed,  the  cluster,  whose  excess  energy  is  to  be  determined,  is  placed 
in  the  CPA  medium.  For  example,  if  Ga-Al  ’near-neighbor’  pair  energy  is  required,  two 
near-neighbor  CPA  atoms  are  removed  and  Ga-Al  pair  is  placed  there.  Then  the  new 
Hamiltonian  is  H  =  H  +  V  where  fj  is  CPA  Hamiltonian  and  is  the  perturbation 
potential  due  to  substitution. 

When  the  change  in  S  due  to  substitution  is  neglected,  new  GF  using  Dyson  expansion, 
is 

G*  =G^-f 

=  (l-G^V)-lG^  (1) 

multiplying  both  sides  of  Eq.(l)  by  S, 

grn  ^  ^  QkyQm 

=  {l~G^V)~^G^  (2) 

Then  the  change  electronic  energy, 

=  -  r^i/mrr(/n(G’")  -  ln{G'^))dE 

J -oo  ^ 

=  -ImTr  -  G^V)dE  (3) 

’T  J-fx> 

To  simplify  further,  we  write 

(l-e*V)  =  (l-G5V)(l-(Gij) 

(  =  V'd  -  GjK)-'  (3) 

where  G*  is  sum  of  diagonal  (G^)  and  non-diagonal  (G*^)  in  site  index.  From  Eq.  (2)  and 
(3)  and  using  the  expansion  that  In(l-x)  =  we  get  the  expression  in  the  familiar 

form. 
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AE.,,  =  l/mTr  /m(1  -  G^V)  -  i/mTrf;  / 

=  ho-^  '^Viji^  titjtf..  (4) 

ijk.. 

If  the  presence  of  both  bulk  and  surface  are  considered,  the  Hq  is  approximately  the 
difference  in  site  term  valuer  and  ic  often  a  dominant  mechanism  for  surface  segregation. 
Aq  is  zero  when  we  consider  only  surface  or  only  bulk  atoms.  All  constant  terms  can  be 
conveniently  removed  from  the  free  energy  calculations,  v^j^.  are  the  cluster  energies  that 
determine  the  ph£ise  diagrams.  Because  the  excess  energies  are  calculated  with  reference  to 
random  alloy,  just  from  the  sign  of  v^jj.  for  various  cluster  energies,  one  can  immediately 
write-down  the  phase  at  T=0  K  (Turchi  et  ah,  1987).  A  minus  sign  for  unlike-atom  pair 
energies  imply  ordering  and  a  plus  sign  for  the  same  pair  imply  segregation  and  so  on. 
However,  for  higher  temperatures  the  distribution  of  these  clusters  should  be  considered. 
The  pair  interaction  energy,  for  example,  between  Ga  and  A1  at  sites  n  and  m  respectively 
is. 


vS;-'”  =  J  (5) 

where  is  the  change  in  the  concentration  at  site  n  due  to  replacement  of  a  CPA 

atom  by  Ga  or  Ah  A  factor  of  2  should  be  included  in  the  cases  where  spin  is  not  explicitly 
considered.  It  should  be  noted  that  Eq.(4)  and  Eq.(5)  appear  much  similar  to  previously 
obtained  results  (Ducastle  et  ah,  1976).  However,  contrary  to  intuition,  conventional  G  is 
replaced  by  and  not  by  G”*. 

We  find  that  that  the  term  values  and  interaction  matrix  elements  of  surface  atoms 
are  much  different  (as  much  as  2  eV  in  some  cases)  from  the  corresponding  bulk  values. 
In  such  cases.  The  surface  GF  is  obtained  in  two  step.  First  by  assuming  bulk  term 
values  on  the  surface,  the  difference  equation  method  used  to  obtain  the  G^.  Then  the 
difference  in  term  values  are  considered  as  local  perturbation  and  the  final  GF  is  obtained 
with  Dyson’s  expansion.  With  different  self  energies  for  bulk  and  surface  atoms,  CPA 
equations  are  solved.  This  self  consistent  procedure  requires  some  generalization,  due  to 
nonorthogonality,  and  it  has  been  included.  An  important  observation  is  that  the  surface 
states  occur  in  the  gap  when  surface  atoms  are  treated  same  as  bulk  atoms.  However, 
when  the  difference  in  term  values  are  included,  the  surface  states  are  pushed  into  the 
valence  bands,  justifying  the  use  of  LDA  studies  of  surface  related  phenomena.  Incorrect 
conduction  bands  do  not  affect  the  total  energy  results. 


237 


Above  described  procedure  is  used  to  calculate  pair  energies  in  the  bulk  and  on  (100), 
(lll)B  surfaces  of  Gcio  5AIQ  5AS.  In  the  case  bulk  Goq  ^AIq  ^As,  the  pair  interaction 
between  Al-Al  and  Ga-Ga  are  attractive  and  that  between  Ga-Al  are  repulsive  with  excess 
pair  energy  of  about  1  meV.  This  tendency  towards  randomness  had  been  suggested  by 
other  calculations  (Zunger)  and  experiments(?).  Similar  calculations  are  carried  out  on 
(lll)B  surface  of  Goq  ^AIq  ^As.  The  excess  pair  energy  is  zero  within  the  accuracy  of 
our  calculations.  This  energy  has  to  be  less  than  zero  (in  our  sign  convention)  to  obtain 
ordering.  Experiments  on  (lll)B  surface  of  this  alloy  do  not  see  ordering(Kuan,  1985). 

On  the  (100)  surface,  there  are  two  kinds  of  surface  bonds.  The  surface  atoms  connec¬ 
ted  by  cr  bonds  are  also  connected  by  first  neighbor  As  on  the  layer  below.  The  tt  bonds 
connects  the  atoms  that  are  connected  also  by  third-neighbor  As  atoms  on  the  layer  be¬ 
low.  The  excess  pair  energies  which  are  nearly  zero  in  the  bulk,  reduces  substantially  and 
become  strongly  attractive  for  unlike-pairs  on  the  surface.  The  pair  energy  for  a  bonds  is 
-100  meV  and  that  for  n  bonds  -30  meV.  The  pair  energy  for  the  like  pairs  have  the  same 
magnitude  but  repulsive.  Such  an  interaction  will  yield  to  ordered  growth  where  every 
Ga(Al)  will  be  surrounded  by  Al(Ga)  in  CuAg-I  phase.  The  energy  parameter,  v  is  defined 
^  (^Ga-Al  ~  Then  using  average  value  of  the  and  a  bond  energies  and 

the  exact  expression  of  {kjg  is  Boltzmann  constant)  for  the  critical  temperature 

for  order-disorder  transition  on  a  square  lattice  (Onsagar,  1944),  we  get  a  value  of  860  K. 
This  is  in  excellent  agreement  with  experiment  where  CuAg-1  ordering  is  observed  during 
the  growth  on  (100)  direction  at  600  K  (Kuan  et  al.,  1985).  In  the  same  experiment,  no 
ordering  was  observed  on  growing  (llI)As  surface,  also  consistent  with  our  calculations. 

The  cause  for  attractive  interaction  between  unlike  pairs  is  not  transparent  from  these 
calculations  as  the  surface  states  are  dispersed  well  in  the  valence  bands.  However,  with  a 
fitted  second-neighbor  TB  Hamiltonian,  one  can  see  that  Al  dangling  states  are  lower  in 
energy  than  Ga  dangling  states.  Thus  by  charge  transfer  from  Ga  to  ’near-neighbor’  Al, 
surface  can  lower  energy,  leading  to  preference  for  Ga-Al  pairs  over  other  pairs.  However, 
the  energy  gained  should  be  larger  than  Coulomb  energy  associated  with  charge  transfer. 
For  pairs  beyond  near-neighbor  sites.  Coulomb  energy  will  be  larger  as  the  charge  transfers 
over  larger  distance.  In  addition,  in  the  absence  of  Al,  Ga- vacancy  pairs  cause  lower  energy 
as  the  charge  transfer  occur  between  Ga  and  much  deeper  As  dangling  bonds.  As  AlAs 
bonds  are  slightly  stronger  than  GaAs  bonds  (cohesive  energy  -1.84  ev  vs  -1.64  ev),  filling 
those  vacancies  with  Al,  rather  than  Ga,  will  lower  energy.  Such  advantages  are  not 
available  for  the  growth  in  (lll)B  direction,  as  the  surface  will  always  be  As  terminated. 
Our  quantitative  evaluation  of  excess  energies  and  critical  temperature  are  consistent  with 
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these  plausible  physical  mechanisms  and  explain  the  observed  ordering. 

In  conclusion,  we  carried  out  first-principles  studies  of  cluster  energies  in  bulk  and 
on  surface:,'.  The  calculated  pair  energies  on  (100)  GaAs  surface's,  indicate  a  strongly 
attractive  interaction  for  Ga-Al  pairs  and  repulsive  interaction  for  Ga-Ga  or  Al-Al  pairs. 
The  magnitude  and  sign  of  these  interactions  yield  a  critical  temperature  of  860  K  and  in 
excellent  agreement  with  experiments.  The  charge  transfer  between  the  neighboring  pairs 
arc  believed  to  cause  attracti^’e  interaction  for  unlike  atoms  leading  to  ordering. 

This  work  w<is  supported  by  ONR  contract  N00014-89-K132  and  by  NASA  contract 
NASl-18226-11.  A  part  of  the  work  was  done  while  SK  was  at  Max  Planck  Institut 
fiir  festkorperforschung,  Stuttgart.  He  thanks  Prof.  M.  Cardona  for  his  hospitality  and 
Alexander  von  Humboldt  foundation,  Bonn,  Germany  for  the  fellowship. 
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Comparison  of  ln,_JI^Sb  and  Hg,_,Cd Je  as  Long 
Wavelength  Infrared  Materials 
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Cohesive  energies,  elastic  constants,  band  structures,  and  phase  diagram  are 
calculated  to  evaluate  the  In,  ,_Tl^Sb  alloy  (ITA)  as  a  long-wavelength  infrared 
(LWIR)  material  compared  to  Hg,  ^Cd,Te  (MCT).  To  obtain  a  0.1  eV  gap  at  zero 
temperature,  the  x  value  for  ITA  is  estimated  to  be  x  =  0.083  as  compared  to 
X  =  0.222  for  MCT.  At  this  gap,  ITA  is  more  robust  than  MCT  because  the  cohesive 
energies  order  as  InSb>TlSb>CdTe>HgTe,  and  ITA  has  the  stronger  bonding 
InSb  as  the  majority  component.  Although  TlSb  is  found  to  favor  the  CsCl 
structure,  ITA  is  a  stable  alloy  in  the  zincblende  structure  for  low  x  values. 
However,  our  phase  diagram  indicates  that  it  is  difficult  to  grow  the  0. 1  eV  gap 
ITA  from  the  melt,  because  above  the  eutectic  the  liquidus  curve  is  flat,  and  the 
solidus  drops  rapidly.  Moreover,  the  width  of  the  stable  concentration  range  of 
the  zincblende  solid  phase  shrinks  at  low  temperatures  due  to  the  presence  of  the 
CsCl  structure. 

Key  words:  Elastic  constants,  InTlSb,  HgCdTe,  long-wavelength  infrared 
materials,  phase  diagram 


INTRODUCTION 

As  growth  technology  continues  to  progress,  re¬ 
searchers  are  always  in  pursuit  of  better  semicon¬ 
ductors.  In  the  area  of  long-wavelength  infrared 
(LWIR)  semiconductors,  efforts  have  been  directed 
into  three  fronts:  refinement  of  the  Hg,  ,,Cd,Te  ( MCT) 
technology,  search  for  alternative  semiconductors, 
and  use  of  superlattices  and  quantum-well  struc¬ 
tures.'  Among  alternative  LWIR  semiconductors, 
obvious  III-V  candidates  are  alloys  of  InSb  with  TlSb 
or  InBi.  An  energy-assisted  epitaxial  growth  tech¬ 
nique^  has  been  used  to  obtain  the  InSb,_^Bi,  alloy 
with  a  gap  in  the  LWIR  range  (8  to  12  p),  but  that 
method  has  never  evolved  into  a  practical  device 
technology.  Thus  it  is  interesting  to  examine  the 
prospects  of  the  In,  ,Tl,Sb  (ITA)  alloy. 

We  have  calculated'*  the  structural  properties,  band 
structures,  and  phase  diagram  to  evaluate  ITA  as  an 
infrared  (IR)  material  as  compared  to  MCT.  At  the 
same  gap'for  LWIR  application,  the  band  structure  of 
ITA  is  similar  to  that  of  MCT,  but  ITA  is  a  more  robust 


(Received  October  12,  1992;  revised  January  13,  1993) 


material.  The  phase  diagram  of  ITA  is  nore  compli¬ 
cated  than  MCT,  because  of  the  presence  of  a  stable 
CsCl  structure  in  TlSb.  Our  phase  diagram  indicates 
that  it  is  difficult  to  grow  ITA  from  the  melt.  However, 
the  zincblende  solid  solution  of  ITA  is  stable  at  low 
thallium  compositions,  which  may  permit  the  growth 
of  ITA  at  the  desired  concentration  using  epitaxial 
growth  methods.  These  studies  show  that  ITA  is  a 
potential  but  challenging  IR  material  deserving  more 
attention. 

LATTICE  CONSTANT,  COHESIVE  ENERGY, 
AND  ELASTICITY 

Table  I  shows  the  cohesive  energies,  atomic  vol¬ 
umes,  and  some  elastic  constants  of  CdTe,  HgTe, 
InSb,  and  TlSb  calculated"  from  the  full-potential 
linear  muffin-tin  orbital  method"*  (FP-LMTO)  within 
the  local  density  approximation  (LDA).  The  calcu¬ 
lated  atomic  volumes  and  elastic  constants  are  in 
good  agreement  with  available  experimental  values. 
The  calculated  cohesive  energies  are  consistently 
larger  than  the  experimental  values  by  about  0.9  eV, 
which  is  typical  in  LDA  owing  mainly  to  errors  in  the 
free  atom.  However,  the  relative  cohesive  energies 
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Table  1.  Calculated  vs  Experimental  Values  of  Atomic  Volume  (V)  ,  Cohesive  Energy  (E),  Shear 

Modulus  <c,,-c,,),  and  Bulk  Modulus  (B) 


CdTe 

HKTe 

InSb 

TISb 

ZB* 

Expt 

ZB* 

Expt 

m 

Expt 

CsCl 

W 

CsCl 

ViA'i 

33.4 

34.2 

34  2 

334 

34.1 

33.8 

25.6 

36  4 

27  1 

EieV/pr  of  atoms  1 

5.28 

4.40 

4  24 

3  24 

6  46 

5  60 

6,08 

5.70 

5  79 

c,,-c, ,  ( 10'-  erg'cm'i 

0.20 

0  17 

0.19 

0  18 

0.31 

0  31 

— 

0  17 

— 

B  1 10'-'  erg/cm ') 

0.44 

0.42 

0  44 

0  47 

0.45 

0  48 

0.72 

0  38 

0  49 

'zincblende  structure 
+C sC!  structure 


Table  II.  Direct  Band  Gaps  (in  eV)  of  Several  Semiconductors  Calculated  in  the  Local  Density 
Approximation  (LDA)  and  Comparison  with  Experimental  Gaps  (Expt) 


CdTe 

GaAs 

InP 

InAs 

InSb 

HgTe 

TISb 

LDA 

0.54 

0.44 

0.47 

-0.39 

-0  46 

-0.95 

-2.20 

LDA-SO 

0.23 

0.32 

0.42 

-0.53 

-0.73 

-1.28 

(-2.50) 

Expt 

1.60 

1.52 

1.42 

0.42 

0.26 

-0.31 

(-1.52) 

Underest. 

1.37 

1.20 

1.00 

0.95 

099 

0.97 

(0.98) 

Note:  Systematic  LDA  errors  are  used  to  deduce  the  baitdgap  (or  TISb.  The  numbers  inside  the  parentheses  are  the  estimated  values 


Fig  1  The  parametrized  band  structures  tor  InSb,  TISb.  CdTe.  and 
HgTe  along  the  (T-L)  and  A  (F-X)  directions. 


are  reliable.  Our  result  shows  that  TISb  slightly 
favors  the  CsCl  over  the  zincblende  structure  at  zero 
temperature.  In  the  zincblende  structure,  the  cohe¬ 
sive  energy  magnitudes  order  as  InSb  >  TISb  >  CdTe 
>  HgTe.  Although  TISb  has  a  weaker  shear  modulus 
than  either  HgTe  or  CdTe.  InSb  is  stronger  than  both. 
As  will  be  shown  latter,  InSb  is  the  majority  com¬ 


ponent  in  ITA  for  LWIR  applications.  These  results 
combined  imply  that  ITA  is  more  robust  structurally 
than  MCT  as  a  LWIR  material. 

BAND  STRUCTURE 

Although  LDA  is  known  to  produce  band  gaps  for 
semiconductors  that  are  too  small,  the  gap  for  TISb 
can  be  deduced  by  exploiting  the  systematic  nature  of 
LDA  errors  as  shown  in  Table  II.  The  first  row  shows 
the  band  gaps  calculated  from  LDA  for  several  direct- 
gap  semiconductors.  These  values  do  not  include  the 
spin-orbit  coupling.  The  second  row  shows  the  LDA 
values  with  the  spin-orbit  (SO)  correction  added.  The 
third  row  lists  the  available  experimental  values.  The 
fourth  row  lists  the  underestimated  values,  ranging 
from  0.95  to  1.37  eV,  in  LDA.  CdTe,  and  to  a  certain 
extent  GaAs,  has  a  larger  underestimate  owing  to  its 
smaller  dielectric  constant.  TISb  should  have  a  dielec¬ 
tric  constant  close  to  InSb  and  HgTe,  so  an  average  of 
the  underestimates  of  the  two,  i.e.  0.98  eV,  can  be  used 
for  TISb  to  obtain  an  energy  gap  of -1.52  eV. 

This  inversion  to  a  negative  gap  can  also  be  es¬ 
timated  and  understood  in  terms  of  tight-binding 
concepts.  Firstly,  due  to  a  stronger  scalar  relativistic 
potential,  the  s-level  of  thallium  is  about  1.4  eV 
deeper  than  that  of  indium,  which  lowers  the  conduc¬ 
tion  minimum  ofTlSb.  Secondly,  the  valence  d-level  of 
thallium  is  higher  than  that  of  indium  by  about  3  eV, 
and  is  only  10  eV  below  the  top  of  the  valence  band.  In 
the  crystal,  the  cation  d-level  interacts  with  the  neigh¬ 
boring  p-state  of  antimony  to  push  up  the  valence 
band  maximum  (X'BM).  This  p-d  repulsion  raises  the 
VBM  in  TISb  by  0.3  eV  higher  than  that  in  InSb. 
Starting  with  a  0.25  eV  gap  for  InSb,  we  subtract  from 
this  value  the  0.3  eV  from  the  p-d  interaction,  the  1.4 
eV  of  the  s-shift.  and  0.05  eV  due  to  a  larger  spin-orbit 
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coupling  10  arrive  at  a  gap  of  - 1  5  e\'  for  TISb  This  p- 
d  interaction  plus  the  spin-orbit  coupling  also  pro¬ 
duces  a  0.35  e\’  valence  band  offset  between  TlSb  and 
InSb.  The  same  mechanism  is  also  responsible  for  the 
band  inversion  and  valence  band  offset  between  HgTe 
and  CdTe. 

To  obtain  more  details,  we  have  used  a  hybrid 
psudopotential  tight-  binding  method  ’  to  parametrize 
the  band  structures  for  the  hosts  and  calculate  the 
band  structures  for  the  alloys.  The  band  structures 
for  the  four  hosts  along  A  ( T-L  i  and  A  ( T-X i  directions 
are  plotted  in  Fig.  1.  The  gaps  for  these  band  struc¬ 
tures  are  0.25,-1.50, 1.60,  and  -0.31  eV,  respectively, 
for  InSb,  TlSb,  CdTe,  and  HgTe.  Based  on  a  scaled- 
virtual  crystal  approximation  (SVCA),®  the  calcu¬ 
lated  band  gaps  for  both  MCT  and  ITA  at  zero  tem¬ 
perature  are  plotted  in  Fig.  2  as  a  function  of  the  alloy 
concentration  x.  The  predicted  x  values  for  the  0.1  eV 
gap  at  zero  temperature  are  x  =  0.083  for  ITA  and,  in 
close  agreement  with  experiment,  x  =  0.222  for  MCT. 
The  band  structures  in  the  vicinity  of  the  band  gap  for 
ITA  and  MCT  at  the  0. 1  eV  gap  are  compared  in  Fig. 
3.  We  can  see  these  two  alloys  have  very  similar  band 
structures  at  the  band  gap.  This  result  implies  that 
ITA  has  electrical  and  optical  properties  similar  to 
MCT  for  LWIR  applications. 

PHASE  DIAGRAM 

The  phase  diagram  of  Hg,  ,Cd,Te  is  very  simple;* 
one  in  which  the  liquidus-solidus  curves  have  a  simple 
lens  shape  and  the  zincblende  solid  solution  is  com¬ 
pletely  miscible  except  at  very  low  temperatures.  The 
presence  of  a  stable  CsCl  solid  phase  in  TlSb  consid¬ 
erably  complicates  the  phase  diagram  of  lOj  ^Tl^Sb. 
Because  of  the  small  lattice  mismatch  between  InSb 
and  TlSb,  the  free  energies  of  the  solid  solutions  in 
both  zincblende  and  CsCl  structures  can  be  approxi¬ 
mated  by  a  regular  solution  model  given  by  F(x)  = 
E(x)-t-kTlxlnx  -*■(  l-x)ln(  l-x>],  and  the  internal  en¬ 
ergy  can  be  written  as  E(x)  =  ( l-x)E(0)-i-xE( l)+x(  1- 
x)Q.  The  end  point  energies  are  given  in  Table  I.  The 
mixing  enthalpies  Q  are  calculated  within  LDA  using 
quasi-random  structures'  and  the  results^  give 


Fig.  2.  Calculated  band  gaps  as  a  function  of  alloy  concentration  x  for 
Hg,_.Cd,Te  and  ln,.,TI,Sb  at  zero  temperature. 


32  me\’  per  two  aium<  for  the  zincblende  siructuie 
and  ii, ,  =  -4  me\’  for  the  CsCl  structure  These  twu 
free  energy  functions  produce  a  large  miscibility  gap 
separating  the  zincblende  iZB'  alloy  from  the  nearl> 
pure  TlSb  crystal  in  the  C.sCl  structure  as  shown  in 
Fig.  4.  Also  shown  are  the  solidus-liquidus  curves 
calculated  from  the  regular  solution  model  In  addi¬ 
tion  to  the  above  data  for  the  solid  solution,  the 
following  data  were  adopted  for  the  liquid  phase 
mixing  enthalpy  =  -1000  caLmole,  melting  tem¬ 
peratures  of  808  and  558K,  and  enthalpies  of  melting 
of  1 1878  and  8629  cabmole  respectively  for  InSb  and 
TlSb.  The  li  value  is  taken  to  be  the  value  previously 
used  for  MCT,"  and  the  other  values  for  the  ZB  TlSb 
are  extrapolated  from  those  of  InSb'*  assuming  they 
scale  as  QiTlSbVQiInSb)  =  Q( HgTe )/Q( CdTe i  Our 
phase  diagram  shows  that  it  is  very  difficult  to  grow 
the  0.1  eV  (x  =  0.083)  ITA  from  the  melt,  because 
above  the  eutectic  the  liquidus  curv'e  is  rather  flat, 
and  the  solidus  drops  rapidly.  Moreover,  the  width  of 
the  stable  concentration  range  of  the  zincblende  solid 


Fig  3  Comparison  of  band  structures  around  the  band  gap  for 
In,  .TI_Sb  and  Hg,  ,Cd.Te  at  the  0  1  eV  gap 


InSb  X  TlSb 

Fig  4  Phase  diagram  of  in,  ,TI,Sb  alloy. 
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phase  shrinks  at  low  temperatures  due  to  the  pres¬ 
ence  of  the  CsCl  structure.  However,  the  ZB  solid  ITA 
solution  is  stable  at  low  x  values  and  moderate  tem¬ 
peratures,  A  metastable  alloy  at  the  desired  concen¬ 
tration  may  be  obtained  using  epitaxial  growth  tech¬ 
niques. 

CONCLUSION  AND  DISCUSSION 

Several  properties  of  ITA  have  been  studied.  Our 
results  show  that,  when  compared  to  MCT  as  LWIR 
material,  ITA  is  superior  in  structural  bonding,  com¬ 
parable  in  electrical  and  optical  properties,  but  more 
difficult  to  grow.  However,  our  work  represents  the 
first  reasonable  materials  evaluation.  The  predicted 
phase  diagram  depends  heavily  on  the  accuracy  of  the 
energy  difTerence  between  the  CsCl  and  the  zincblende 
structures  of  both  InSb  and  TlSb,  and  to  a  less  extent, 
on  the  other  parameters  used.  Further  studies  are 
called  for  to  test  these  suggestions  and  to  explore  the 
potential  of  this  LWIR  material. 
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lni_  ,Tl^b  is  proposed  as  promising  infrared  material.  A  number  of  optical  and  structural 
properties  are  studied  within  local  density-functional  theory.  The  alloy  at  x  =  0.09  is  estimated 
to  have  a  gap  of  0. 1  eV.  Although  TlSb  is  found  to  favor  the  CsCl  structure,  the  zinc  blende  alloy 
is  stable  for  low  x  values.  A  phase  diagram  is  calculated  to  estimate  the  regions  of  stable  phases 
and  explore  the  conditions  for  growing  narrow-gap  In|_^Tl^b  alloys. 


The  alloy  Hg,_,Cd^Te  with  x=0.22  is  currently  the 
most  widely  used  material  for  long-wave  infrared  ( LWIR, 
8-12  fim)  focal-plane  arrays  (FPA).  However, 
Hg,.,Cd,Te  is  a  weakly  bound  II-V  compound  with  ma¬ 
terials  growth  and  processing  problems'  dominated  by  na¬ 
tive  defects  that  limit  FPA  performance  and  yield.  A  stur¬ 
dier  alternative  to  Hgi_,Cdj,Te  offering  a  comparable  IR 
response  would  significantly  benefit  FPA  costs  and  perfor¬ 
mance.  Here  we  argue  that  the  III-V  alloy  Ini_j,Tl^b 
holds  much  promise  as  a  suitable,  sturdier  alternative  to 
Hg,_,Cd,Te. 

InSb,  a  III-V  zinc  blende  (ZB)  semiconductor  with  a 
band  gap  of  0. 1 8  eV  at  room  temperature,  responds  to  the 
midwave  (3-5  fim)  infrared  band  and  is  usually  preferred^ 
to  Hgo£9Cdo3|Te  with  a  comparable  band  gap,  because 
InSb  is  the  more  strongly  bound.  However,  the  band  gap  of 
InSb  is  too  wide  for  LWIR  applications.  Two  alternatives 
are  currently  under  study  to  produce  III-V  based  materials 
capable  of  LWIR  response;  strained  layer  superlattices^ 
and  multiple  quantum  well  structures.^  These  materials 
must  be  prepared  either  by  metalorganic  vapor  deposition 
(MOCVD)  or  by  molecular  beam  epitaxy  (MBE).  Both 
methods  are  expensive  and  while  advances  have  been  dem¬ 
onstrated,  it  is  not  clear  that  the  individual  device  perfor¬ 
mance,  uniformity  over  an  array,  and  material  stability  will 
better  those  of  Hgi_;,Cd,Te. 

InSb  has  the  narrowest  band  gap  of  the  ZB  semicon¬ 
ductors.  Alloying  the  Sb  with  Bi,^  or  alloying  the  In  with 
T1  can  potentially  na'^'^ow  the  band  gap.  In  its  ground  state, 
InBi  is  a  tetragonal  structure,  which  limits  concentrations 
over  which  the  equilibrium  alloy  is  miscible.*  Concentra¬ 
tions  high  enough  to  narrow  the  gap  into  the  LWIR  range 
have  been  prepared,  but  only  using  a  difficult  Ar  energy- 
assisted  epitaxial  growth  technique’  that  never  evolved  into 
a  practical  method  for  a  LWIR  material. 

Very  little  work  has  been  reported  on  the  In,_j,Tl^b 
alloy.  This  letter  is  devoted  to  calculation  of  the  materials 
properties  of  In|_;,Tl;5b  within  the  framework  of  density- 
functional  theory.  Questions  about  the  properties  of 
ln|_;,Tl^b  center  around  two  key  issues.  First,  stipulating 
that  the  alloy  assumes  a  ZB  lattice  with  either  T1  or  In 
occupying  the  cation  sites,  how  do  the  structural  and  op¬ 
tical  properties  compare  with  Hg|_,Cd^Te?  Second,  our 
calculations  show  that  TlSb  slightly  prefers  the  CsCl  lattice 


to  the  ZB.  Under  what  thermodynamic  conditions  can 
In|_j,TljjSb  be  grown  in  a  ZB  structure? 

For  the  calculations  presented  here,  except  where  oth¬ 
erwise  stated,  an  all-electron,  soft  core,  full-potential 
method  of  linear  muffin  tin  orbitals  (LMTO)  was  em- 
ployed.‘  A  basis  of  22  orbitals/atom  was  used,  including 
three  s,  three  sets  of  p,  and  two  sets  of  d  states.  This  was 
sufficient  to  converge  the  total  energy  to  an  absolute  pre¬ 
cision  of  about  1  mRy/atom.  In  all  cases,  enough  k-points 
were  used  to  converge  the  mesh  to  an  absolute  precision  of 
better  than  0.1  mRy;  this  required  between  30  and  128 
points,  depending  on  the  crystal  structure.  Thus,  the  re¬ 
sults  presented  here  should  be  very  close  to  exact  solutions 
of  the  local  density  approximation  (LDA). 

Table  I  shows  some  calculated  properties  of  CdTe, 
HgTe,  InSb,  and  TlSb.  Experimental  data  tabulated  with 
the  first  three  give  the  reader  a  sense  of  the  errors  to  be 
expected  from  these  calculations.  As  is  typical  in  the  LDA, 
the  atomic  volume  is  predicted  to  within  about  2%;  the 
cohesive  energy  is  overestimated  by  about  0.9  eV  (owing 
mainly  to  errors  in  the  free  atom;  gradient  corrections  to 
the  LDA  largely  eliminate  this  error);  and  the  band  gap  is 
underestimated  by  approximately  0.7  eV,  the  underesti¬ 
mate  larger  for  CdTe  owing  to  its  smaller  dielectric  con¬ 
stant.  Exploiting  the  systematic  trends  in  errors  for  the 
band  gap  and  cohesive  energy,  column  "Adj”  adjusts  those 
calculated  values  for  TlSb,  were  it  to  assume  a  ZB  lattice. 

Table  I  illustrates  several  points  of  interest.  TlSb  is 
lattice-matched  to  InSb  to  within  2%:  it  has  a  compara¬ 
tively  strong  bond,  considerably  stronger  than  in  HgTe.  Its 
shear  modulus  Cn  —  c,2  is  rather  small  for  a  Ill-V  material, 
possibly  indicative  of  its  preference  for  the  CsCl  structure 
(see  below).  Probably  the  most  important  point  concerns 
the  band  gaps.  The  difference  £g( CdTe )  —£g( HgTe)  =  1.9 
eV,  is  comparable  to  £g(InSb)  —  ^^(TlSb)  =  1.8  eV.  As¬ 
suming  a  linear  dependence  on  the  gap,  Hg,  _^Cd,Te  has  a 
100  meV  gap  at  x  =  0.22,  while  for  In,_,Tlj^b  the  gap 
reaches  100  meV  at  x=0.09  at  OK.  (At  3(X)  K  InSb  has  a 
gap  of  0.18  eV  and  it  reaches  100  meV  at  x=0.05.)  Thus 
the  predominant  cation  in  Hg|_;,Cd^Te  is  the  heavier, 
weakening  Hg,  while  in  In,_j,nj^b  it  is  the  lighter  and 
strengthening  In.  The  cohesive  strength  of  Ini_;,Tlj^b  will 
be  about  50%  greater  than  Hg,  _j,CdjtTe;  as  a  consequence, 
we  expect  it  to  be  far  more  stable  and  freer  of  defects. 
Because  the  material  is  more  robust,  processing  should  be 
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TABLE  1.  Calculated  and  experimental  optical  and  structural  properties  for  CdTe,  HgTe,  InSb,  and  TlSb  in  a  ZB  lattice  at  0  K.  Atonnic  volume  O  is 
in  cohesive  energy  per  cation  £  and  band  gap  £,  are  in  eV;  Ch— Cu  and  bulk  modulus  B  are  lO'^  erg/cm’  Calculated  values  for  InSb  m  the  CaO 
and  TISb  in  both  ZB  and  CsCI  lattices  are  shown.  Column  Adj  adjusts  the  calculated  values  of  TlSb  in  the  ZB  structure  for  errors  in  the  LDA 


CdTe 

HgTe 

InSb 

TISb 

ZB 

Calc 

Expt 

Calc 

Expt 

Calc 

Expt 

CsCI 

ZB 

Adj 

CsCI 

n 

33.4 

34.2 

34.2 

334 

34.1 

33.8 

25.6 

36.4 

27.1 

£ 

5.28 

4.40 

4.24 

3  24 

6.458 

560 

6.078 

5  704 

4  8 

5  792 

0.20 

0.17 

0  19 

0.18 

0.31 

0.31 

0.17 

B 

0.44 

0.42 

044 

0.47 

045 

048 

0.72 

0.38 

049 

0.54 

1.60 

-0.95 

-0.30 

-0.46 

0.26 

-2  20 

-1.5 

much  simpler.  For  example,  we  anticipate  no  significant 
problem  of  T1  loss  in  an  annealing  step. 

The  valence  band  offset  is  one  other  point  of  interest 
concerning  optical  properties  of  In|_j,Tlj4b.  A  simple  es¬ 
timate  can  be  made  in  the  context  of  the  atomic  spheres 
approximation  (ASA).  Because  the  ASA  aligns  the  bands 
with  respect  to  a  common  reference  potential  (the  Made- 
lung  potential),  it  forms  a  good  trial  potential  for  an  inter¬ 
face  between  similar  materials,  and  the  dipole  with  respect 
to  this  potential  is  generally  small.  If  the  dipole  is  zero,  the 
hand  offset  of  two  materials  is  the  difference  in  the  valence 
band  edges,  calculated  from  separate  bulk  calculations.  At 
the  lattice  parameter  of  InSb,  the  band  offset  of  TlSb/InSb 
was  calculated  to  be  0.33  eV.  As  a  comparison,  the  band 
offset  of  HgTe/CdTe  comes  out  to  0.30  eV  in  the  ASA.  In 
both  cases,  the  origin  of  the  band  offset  arises  from  cou¬ 
pling  of  the  deep  d  states  to  the  anion  p  orbitals  at  T.  The 
heavier  cation  is  shallower:  it  couples  more  strongly  and 
exerts  a  stronger  repulsion  on  the  valence  band  maximum. 

Next  we  turn  to  an  examination  of  the  Ini_,Tlj,Sb  ZB 
alloy.  Generally  speaking,  ZB  alloys  Ai_jfB,C  have  a  dis¬ 
position  to  spinoidal  decomposition  into  the  bulk  constit¬ 
uents  (AC)  (BC),.  But  when  the  AC  and  BC  lattices 
match  well  as  in  HgCdTe  and  AlGaAs,  strain  energy  from 
accommodating  bond-length  mismatch  is  absent  and  the 
energy  cost  in  forming  the  alloy  is  quite  small.  Indeed  we 
find  that  to  be  the  case  in  Ini_j,Tl;^b. 

To  show  this  for  the  In^  jTlo  jSb  alloy,  three  structures 
were  considered:  the  "CuAu”  structure,’  and  two  quasi¬ 
random  structures  proposed  by  Zunger  and  co-workers 
which  simulate  a  random  alloy  (see  Table  II  and  Ref.  9). 
In  the  ideal  lattice,  the  energies  of  the  three  structures  are 

Table  l.  structural  properties  of  three  ordered  alloys  of  Itv,,TIo5Sb. 
The  QR-4  and  QR-8  structures  are  4-  and  g-ca(ion  superlattices  taken 
from  Zunger  and  co-workers.*  Short-range  correlation  functions  of  QR-4 
and  QR-8  mimic  those  of  the  random  alloy,  and  are  named  “quasiran¬ 
dom”  structures.  VCA  is  the  virtual  crystal  average  of  this  table.  The 
energy  of  each  structure  was  minimized  with  respect  to  the  cell  volume,  f 
is  the  energy  of  the  structure  on  an  ideal  lattice:  £*  is  the  energy  when  the 
lattice  is  allowed  to  relax. 


VCA 

CuAu 

QR-4 

QR-g 

a 

6  554 

6.554 

6554 

6.554 

e 

6.082 

6.070 

6.070 

6.070 

tr 

6082 

6.074 

6074 

•Reference  9. 


found  to  be  identical  to  within  1  meV/cation.  In  the  ideal 
lattice  these  structures  are  found  to  be  12  meV  less  stable 
than  the  VCA  average;  on  allowing  the  lattice  to  relax  in 
the  CuAu  and  the  4-cation  quasirandom  structure,  this 
difference  dropped  to  8  meV/cation.  The  smallness  in  de¬ 
viation  from  the  VCA  average  and  the  independence  of  the 
energy  on  crystal  structure  suggests  that  the  effective  alloy 
Hamiltonian  is  weak  and  well  approximated  by  a  simple 
Ising  model.  The  internal  energy  of  a  random  alloy  then 
assumes  a  form 

£(x)  =  ( 1 -X)  •  £(x=0) -f  X  ■  £(x=  1 ) -(-njt(  1 -x) 

(1) 

with  fl=ft2=32  meV. 

We  now  turn  to  the  second  set  of  issues,  namely  the 
miscibility  of  T1  in  In,_,Tlj^b  in  the  ZB  structure.  This 
miscibiUty  is  limited  in  InSb  by  the  apparent  fact  that  TISb 
prefers  the  CsCI  lattice  to  the  ZB.  Table  I  shows  that,  while 
InSb  favors  the  ZB  lattice  by  380  meV/cation,  TISb  favors 
the  CsCI  lattice  by  88  meV/cation.  To  examine  this  in 


FIG.  I  Phase  diagram  of  InTlSb  along  the  ln|_,Tl,Sb  line.  The  shaded 
region  is  the  desired  solid  solution  of  In,  _,TI,Sb  in  the  ZB  structure.  The 
solid  (changing  into  dotted)  lines  demarcate  the  separation  of  the  ZB 
solid  solution  of  In , .  ,Tl,Sb  on  the  left,  a  very  narrow  solid  solution  in  the 
CsO  structure  on  the  extreme  right  (see  arrow),  and  a  two-phase  eutectic 
of  these  two  phases  (marked  "ZB -i- CsCI")  tn  the  center.  (TTiese  lines  are 
shown  as  dotted  above  the  speculated  melting  point,  as  they  are  no  longer 
meaningful  there. )  The  dashed  lines  are  our  speculated  liquidus  and  soli¬ 
dus;  the  eutectic  point  at  jt=0.8,  T=700  K  was  chosen  arbitrarily.  The 
region  marked  "ZB  -f  Liq"  is  a  two  phase  mixture  of  In ,  _  ,TI;,Sb  in  the  ZB 
phase  and  liquid.  It  is  evident  from  this  figure  that,  even  while  there  exists 
a  stable  phase  of  In, .  ,TI,Sb  in  the  ZB  structure,  the  only  accessible  phase 
from  the  liquid  is  the  eutectic  ZB -l- CsCI. 
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greater  detail,  we  have  calculated  the  total  energies  of  some 
ordered  alloys  of  In|  ,TI^b  in  the  CsCI  structure  in  ad¬ 
dition  to  those  studied  in  the  ZB  structure  (Table  1 )  As  in 
the  ZB  structure,  compounds  studied  in  the  CsCl  structure 
exhibited  a  very  small  deviation  in  the  mixing  energy  from 
the  virtual  crystal  average;  the  deviation  was  found  to  be 
slightly  negative,  but  smaller  than  in  the  ZB  structure. 
Thus  Eq.  ( 1 )  also  approximately  describes  the  x  depen¬ 
dence  of  the  internal  energy  in  the  CsCl  structure,  with 
n  =  meV.  Using  th;-se  data,  we  have  obtained 

that  section  through  the  ternary  phase  diagram  which  joins 
the  InSb  and  TlSb  binary  compositions. 

To  describe  the  miscibility  of  the  ZB  and  CsCl  struc¬ 
tures,  we  approximate  the  free  energy  of  each  structure 
with  a  regular  solution  model.  The  free  energy  of  either  ZB 
phase  is  then  f(x)  =  (l— x)£(0)-|-x:£(l)  +  ftjc(l— x) 
-(-/tr[j:ln.x+ ( 1  — x)ln(  1  — jc)),  with  fl  =  fl2  or  fl^.  By 
constructing  common  tangent  lines  between  the  two  frec- 
energy  curves  in  the  usual  way,  we  obtain  the  solid  phase 
portion  of  the  phase  diagram  that  exhibits  three  regions. 
As  Fig.  1  shows,  one  region  is  found  in  the  ZB  structure 
near  the  InSb  composition,  and  a  corresponding  region 
very  near  to  the  TlSb  composition  in  the  CsCl  structure.  A 
two-phase  eutectic  occupies  the  central  and  largest  part  of 
the  diagram. 

Also  shown  in  dotted  lines  are  the  hypothetical  solidus 
and  liquidus.  Explicit  calculation  of  this  portion  of  the 
phase  diagram  was  not  attempted  here,  particularly  since 
we  do  not  know  the  melting  point  of  TlSb.  Some  general 
observations  can  be  made,  however.  InSb  is  known  to  melt 
congruently  at  808  K,  which  fixes  the  x=0  point  of  the 
liquids.  Also  our  calculations  show  that  the  bulk  modulus 
of  nSb  in  the  CsCl  structure  is  comparable  to  InSb  in  the 
ZB,  owing  to  the  compressed  atomic  volume.  According  to 
Lindeman’s  law,  melting  takes  place  when  lattice  distor¬ 
tions  reach  about  10%  of  the  bond  length.  Crudely 
speaking,  the  vibrational  amplitudes  of  InSb(AB)  and 
TlSb(CsCl)  are  comparable,  and  thus  we  speculate  that 
the  melting  points  may  also  be  comparable.  It  is  almost 
certain,  however,  that  the  solidus  near  x=0  drops  quickly, 
owing  to  a  softening  of  the  ZB  phase  on  alloying  with  Tl. 

Fortunately,  the  most  important  feature  of  Fig.  1  does 
not  depend  in  any  major  way  in  the  uncertainty  in  the 
liquidus  and  solidus.  There  exists  below  the  melting  point 
a  stable  region  of  the  In,  _,Tl;^b  alloy  in  the  ZB  lattice  for 
Tl  concentrations  in  the  range  of  5%-15%.  On  the  basis  of 
Table  I  this  is  sufficient  to  narrow  the  band  gap  to  the  8-12 
jum  range.  Figure  1  suggests  why  this  region  may  be  diffi¬ 
cult  or  impossible  to  reach  using  liquid  phase  epitaxy.  If 
the  liquidus  is  flat  for  a  wide  concentration  range  as  de¬ 
picted  in  Fig.  1,  and  if  the  solidus  falls  off  rapidly  (as  it 


almost  certainly  does),  it  is  extremely  difficult  to  solidify  a 
crystal  in  the  ZB  lattice  from  the  liquid  phase  with  any 
significant  amounts  of  Tl  Indeed,  this  has  been  observed 
experimentally  "  Nevertheless,  a  stable  phase  could  exist 
according  to  these  calculations,  and  using  other  growth 
techniques  it  should  be  possible  to  reach  it. 

To  summarize,  our  calculations  show  that  In|  _^Tl^b 
holds  much  promise  as  a  viable,  more  robust  alternative  to 
Hgi_,Cd^Te  for  use  in  long-wave  infrared  detectors.  A 
number  of  electronic  and  mechanical  properties  were  cal¬ 
culated  for  InSb,  TlSb,  and  In|_,Tl^b  to  support  this 
argument.  Perhaps  the  most  important  finding  is  that  a 
relatively  small  amount  of  Tl  (approximately  10%)  is 
needed  to  narrow  the  band  gap  to  the  required  size.  The 
III-V  Ini_^Tl^b  is  more  robust  than  the  II-VI 
Hgi_j,Cd,Te  not  only  by  virtue  of  it  being  a  IIl-V  com¬ 
pound,  but  also  because  it  is  composed  predominantly  of 
the  lighter,  strengthening  cation.  Equally  important,  these 
calculations  show  that  there  exists  a  stable  region  in  the 
Ini_,Tl_^b  plane  of  the  phase  diagram  for  which  Tl  is 
miscible  in  InSb  in  concentrations  up  to  lS%-20%. 

Note  in  proof:  It  has  recently  come  to  our  attention" 
that  the  binary  phase  diagram  of  TlSb  is  more  complicated 
than  the  early  literature  suggests.  Nevertheless,  we  believe 
our  central  conclusion  remains  valid,  namely  that  there 
will  be  a  substantial  stable  phase  of  Ini.^Tl^^b  in  the  ZB 
structure,  since  that  conclusion  mainly  derives  from  the 
result  that  InSb  strongly  favors  the  ZB  phase  to  the  close- 
packed  one,  while  TlSb  only  weakly  favors  a  close-packed 
phase  to  the  ZB. 

This  work  was  supported  by  ONR  Contracts  N(XX)14- 
88-C-0096  and  N(X)014-89-K-0132.  We  wish  to  thank  A. 
T.  Paxton,  M.  A.  Berding,  and  J.  M.  MacCallum  for  help¬ 
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